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COMBINATION APPROACHES FOR GENERATING IMMUNE 
RESPONSES 



5 Technical Field 

The present invention relates to compositions comprising a polynucleotide 
component and a polypeptide component that can be used for the generation of 
immune responses in a subject. In one aspect, the compositions of the present 

invention are used in methods to generate immune responses in subjects to which the 
10 compositions are administered. In another aspect, the compositions of the present 
invention are used in methods of generating neutrahzing activity against multiple 
subtypes, serotypes, or strains of a selected microorganims , for example, viruses (e.g., 
Human Immunodeficiency Virus (HIV)). 

1 5 Background of the Invention 

Acquired immune deficiency syndrome (AIDS) is recognized as one of the 
greatest health threats facing modem medicine. There is, as yet, no cure for this 
disease. 

In 1983-1984, three groups independently identified the suspected etiological 
20 agent of AIDS. See, e.g., Barre-Sinoussi et al. (1983) Science 220:868-871; 

Montagnier et al., in Human T-Cell Leukemia Viruses (Gallo, Essex & Gross, eds., 
1984); Vilmer et al. (1984) The Lancet 1:753; Popovic et al. (1984) Science 224:497- 
500; Levy et al. (1984) Science 225:840-842. These isolates were variously called 
lyraphadenopathy-associated virus (LAV), human T-cell lymphotropic virus type III 
25 (HTLV-m), or AIDS-associated retrovirus (ARV). All of these isolates are strains of 
the same virus, and were later collectively named Human Immunodeficiency Virus 



1 



PP209 12.001 
PATENT 



(HTV). With the isolation of a related AIDS-causing virus, the strains originally called 
HIV are now termed HIV-1 and the related virus is called HIV-2 See, e.g., Guyader et 
al. (1987) Nature 326:662-669; Bnm-Vezinet et al. (1986) Science 233:343-346; 
Clavel et al. (1986) Nature 324:691-695. 
5 A great deal of information has been gathered about the HIV virus; however, 

to date an effective vaccine has not been identified. Several targets for vaccine 
development have been examined including the env and Gag gene products encoded 
by HIV. Gag gene products include, but are not limited to, Gag-polymerase and Gag- 
protease. Env gene products include, but are not limited to, monomeric gpl20 

1 0 polypeptides, oligomeric gp 1 40 polypeptides and gp 1 60 polypeptides. 

Haas, et al., {Current Biology 6(3):3 15-324, 1996) suggested that selective 
codon usage by HIV- 1 appeared to account for a substantial fraction of the 
inefficiency ofviral protein synthesis. Andre, et al., (J. Firo/. 72(2): 1497-1503, 1998) 
described an increased immune response elicited by DNA vaccination employing a 

1 5 synthetic gp 1 20 sequence with modified codon usage. Schneider, et al., (J Virol. 
71(7):4892-4903, 1997) discuss inactivation of inhibitory (or instability) elements 
(INS) located within the coding sequences of the Gag and Gag-protease coding 
sequences. 

The Gag proteins of HIV-1 are necessary for the assembly of virus-like 
20 particles. HIV- 1 Gag proteins are involved in many stages of the life cycle of the virus 
including, assembly, virion maturation after particle release, and early post-entry steps 
in virus replication. The roles of HIV-1 Gag proteins are numerous and complex 
(Freed, E.G., Virology 25\:l-l5, 1998). 

Wolf, et al, (PCX International Publication No. WO 96/30523, published 3 
25 October 1996; European Patent Application, Publication No. 0 449 1 16 Al, published 
2 October 1991) have described the use of altered pr55 Gag of HIV-1 to act as a non- 
infectious retroviral-like particulate carrier, in particular, for the presentation of 
immunologically important epitopes. Wang, et al., {Virology 200:524-534, 1994) 
describe a system to shxdy assembly of HIV Gag-beta-galactosidase fusion proteins 
30 into virions. They describe the construction of sequences encoding HIV Gag-beta- 
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galactosidase fusion proteins, the expression of such sequences in the presence of HIV 

Gag proteins, and assembly of these proteins into virus particles. 

Shiver, et al., (PCT Memational Publication No. WO 98/34640, published 13 

August 1998) described altering HIV-1 (CAMl) Gag coding sequences to produce 
5 synthetic DNA molecules encoding HTV Gag and modifications of HIV Gag. The 

codons of the synthetic molecules were codons preferred by a projected host cell. 

Recently, use of HIV Env polypeptides in immunogenic compositions has been 

described, (see, U.S. Patent No. 5,846,546 to Hurwitz et al., issued December 8, 1998, 

describing immunogenic compositions comprising a mixture of at least four different 
10 recombinant virus that each express a different HIV env variant; and U.S. Patent No. 

5,840,313 to Vahlne et al., issued November 24, 1998, describing peptides which 

correspond to epitopes of the HIV-1 gpl20 protein). In addition, U.S. Patent No. 

5,876,731 to Sia et al, issued March 2, 1999 describes candidate vaccines against HIV 

comprising an amino acid sequence of a T-cell epitope of Gag linked directly to an 
1 5 amino acid sequence of a B-cell epitope of the V3 loop protein of an HIV- 1 isolate 

containing the sequence GPGR. 

PCT International Publication Nos. WO/00/39302; WO/00/39303; 

WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and WO/03/020876 

described a number of codon-optimized HIV polypeptides, as well as some native HIV 
20 sequences. Further, a variety of HIV polypeptides comprising mutations were 

described. The use of these HIV polypeptides in vaccine compositions and methods 

of immunization were also described. 

The present invention provides improved compositions and methods for 

generating immune responses against multiple subtypes, serotypes, or strains of a 
25 selected microorganism , for example, a virus (e.g., HIV-1). 

Summary of the Invention 

The present invention relates to compositions and methods for their use for 
generating an immune response in a subject. The compositions of the invention 

30 comprise at least two components wherein each component provides a different but 
analogous polypeptide immunogen. The polypeptide immunogen is provided either 



3 



PP20912.001 
PATENT 



directly in the form of a polypeptide (including polypeptide fragments, modified 
forms, encapsulated forms, etc.) or indirectly as a polynucleotide immunogen 
(including DNA and/or RNA encoding a polypeptide immunogen). The compositions 
of the present invention may be used in methods to generate immune responses in 
5 subjects to which the compositions are administered, wherein the immune response is 
directed against multiple subtypes, serotypes, or strains of a selected microorganims , 
for example, viruses (e.g., Human Immunodeficiency Virus (HIV)). In a preferred 
embodiment, the present invention relates to compositions comprising a 
polynucleotide component and a polypeptide component that can be used for the 

10 generation of immune responses in a subject, for example, the generation of 

neutralizing antibodies. Other embodiments comprising at least two polynucleotide 
components each providing a different but analogous polypeptide immunogen, or 
embodiments comprising at least two polypeptide components each providing a 
different but analogous polypeptide immunogen are also contemplated. 

15 hi a first aspect, the present invention includes a composition for generating an 

immune response in a mammal. These compositions typically comprise a 
polynucleotide component consisting essentially of one polynucleotide encoding an 
HIV immunogenic polypeptide, and a polypeptide component comprising one or more 
HIV immunogenic polypeptides analogous to the polypeptide encoded by the 

20 polynucleotide component, with the proviso that at least one HFV immunogenic 
polypeptide of the polypeptide component is derived fi-om a different HIV subtype 
than the subtype from which the immunogenic polypeptide encoded by the 
polynucleotide component is derived. In other words, in this first aspect, the HIV 
immunogenic polypeptide encoded by the polynucleotide component and the 

25 analogous HIV immunogenic polypeptide comprising the polypeptide component are 
derived firom different subtypes of HIV. 

In a second aspect, the present invention includes compositions for generating 
an immune response in a mammal. These compositions typically comprise a 
polynucleotide component comprising two or more polynucleotide sequences 

30 comprising coding sequences for two or more analogous HIV immunogenic 

polypeptides, wherein the coding sequences for at least two of the HIV immunogenic 
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polypeptides are derived from different HIV subtypes, and a polypeptide component 
comprising one or more HIV immunogenic polypeptides analogous to the polypeptide 
encoded by the polynucleotide component, with the proviso that if the polypeptide 
component comprises the same number or greater than the number of analogous HIV 
5 immunogenic polypeptides encoded by the polynucleotide component, then at least 
one of the HIV immunogenic polypeptides of the polypeptide composition is derived 
from a different HIV subtype than the HIV immunogenic polypeptides provided by 
the polynucleotide component. 

The polynucleotide components of both of these aspects may comprises at 

1 0 least one polynucleotide that is a native polynucleotide. Alternately, or in addition, 
the polynucleotide components may comprise at least one polynucleotide that is a 
synthetic polynucleotide. Synthetic polynucleotides may comprise codons optimized 
for expression in mammalian cells (e.g., human cells). The polynucleotide component 
may comprise a single polynucleotide molecule, or two or more different 

1 5 polynucleotide molecules, each encoding one or more HIV polypeptides. The 
polynucleotide component may comprise DNA or RNA or both. 

The HIV immunogenic polypeptides (encoded by the polynucleotide 
component and/or those which comprise the polypeptide component) may be HIV 
envelope polypeptides. The HIV polypeptides may comprises one or more mutations 

20 compared to the wild-type HIV polypeptide (e.g., in the case of envelope proteins, at 
least one of the envelope polypeptides may comprise a mutation in the cleavage site 
or a mutation in the glycosylation site, a deletion or modification of the VI region, a 
deletion or modification of the V2 region, a deletion or modification of the V3 region, 
modifications to expose an envelope binding region that binds to a CCR5 chemokine 

25 co-receptor, and combinations thereof). Other immunogenic HIV polypeptides may 
include, but are not limited to, Gag, Env, Pol, Prot, Int, RT, vif, vpr, vpu, tat, rev, and 
nef polypeptides. 

The subtypes from which the HIV immunogenic polypeptides and coding 
sequences therefore may be selected mclude, but are not limited to. Subtype A, 
30 Subtype B, Subtype C, Subtype D, Subtype E, Subtype F, Subtype G, Subtype H, 
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Subtype I, Subtype J, Subtype K, Subtype N and Subtype 0, as well as any of the 
identified CRFs. 

In addition to immunogenic HIV polypeptides and sequences encoding same, 
the polynucleotide component may encode and the polypeptide component may 
5 comprise one or more additional antigenic polypeptides which may include antigenic 
polypeptides not derived from HIV-1 coding sequences. 

The polynucleotide component may further comprise sequences encoding one 
or more control elements compatible with expression in a selected host cell, wherein 
•the control elements are operable linked to polynucleotides encoding HIV 

1 D immunogenic polypeptides. Exemplary control elements include, but are not limited 
to, a transcription promoter (e.g., CMV, CMV+intron A, SV40, RSV, HIV-Ltr, 
MMLV-ltr, and metallothionein), a transcription enhancer element, a transcription 
termination signal, polyadenylation sequences, sequences for optimization of initiation 
of translation, internal ribosome entry sites (e.g., ECMV IRES) and translation 

15 termination sequences. 

The polynucleotide component may comprise further components as described 
herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 
component may comprise further components as described herein (e.g., carriers, 
adjuvants, immunoenhancers, etc.)- 

20 The present invention also includes methods of generating an immune 

response in a subject. In one embodiment of the method, a composition for generating 
an immune response in a mammal of the present invention, for example, as described 
above, is provided. One or more gene dehvery vectors comprising the polynucleotides 
of the polynucleotide component of the composition are administered to the subject 

25 under conditions that are compatible with expression of the polynucleotides in the 
subject for the production of encoded HIV immunogenic polypeptides. Further, the 
polypeptide component of the composition for generating an immune response is 
administered to the subject. 

The one or more gene delivery vectors and the polypeptide component may be 

30 administered, for example, concurrently or sequentially. 
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The polynucleotide component may comprise further components as described 
herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 
component may comprise further components as described herein (e.g., carriers, 
adjuvants, immunoenhancers, etc.). 
5 The one or more gene dehvery vectors may comprise, for example, nonviral 

and/or viral vectors. Exemplary non-viral vectors include, but are not limited to 
plasmids or expression cassettes. Exemplary viral vectors include, but are not hmited 
to retroviral, lentiviral, alphaviral, poxviral, herpes viral, adeno-associated viral, 
polioviral, measles viral, adenoviral vectors, or other known viral vectors. The one or 

10 more gene delivery vectors may be delivered using a particulate carrier, for example, 
coated on a gold or tungsten particle and the coated particle may be delivered to the 
subject using a gene gun, or PLG particles delivered by electroporation or otherwise. 
Alternatively, the one or more gene dehvery vectors are encapsulated in a liposome 
preparation. The one or more gene delivery vectors may be administered, for 

15 example, intramuscularly, intramucosally, intranasally, subcutaneously, intradermally, 
transdermally, intravaginally, intrarectally, orally, intravenously, or by combinations 
of these methods. 

The subjects of the methods of the present invention are typically marmnals, 
for example, humans. 

20 The immune response generated by the methods of the present invention may 

be humoral and/or cellular. In one embodiment, the immune response results in 
generating neutralizing antibodies against multiple HIV-subtypes in the subject. 

These and other embodiments of the present invention will readily occur to 
those of ordinary skill in the art in view of the disclosure herein. 

25 

Brief description of the Figures 

Figures 1 A to ID depict the nucleotide sequence of HIV Subtype C 
8_5_TV1_C.ZA (SEQ ID N0:1; refened to herein as TVl). Various regions are 
shown in Table 1. 

30 Figures 2 A-2E depicts an alignment of Env polypeptides from various HIV 

isolates (Subtype B-SF162, Subtype C-TV1.8_2, Subtype C-TV1.8_5, Subtype C- 
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TV2.12-5yi, Subtype C-MJ4, India Subtype C-93IN101, Subtype A-Q2317, Subtype 
D-92UG001, Subtype E-cm235, and a Consensus Sequence). The arrows indicate 
exemplary regions for deletions and/or truncations in the beta and/or bridging sheet 
region(s). The "*" denotes N-linked glycosylation sites, one or more of which can be 
5 modified {e.g., deleted and/or mutated; one such possible mutation is mutation (N-^ 
Q)). 

Figure 3 presents a schematic diagram showing the relationships between the 
following forms of the HIV Env polypeptide: gpl60, gpl40, gpl20, and gp41 . 

Figure 4 presents neutraUzing antibody activity data against HW-l subtype B 
10 strain SF162 obtained from a number of different immunization protocols in rabbits. 

Figure 5 presents neutralizing antibody activity data against HIV-1 subtype C 
strain TVl obtained from a number of different immunization protocols in rabbits. 

Figure 6 presents the nucleotide sequence of the polynucleotide designated 
gpl40.modSF162.delV2. 
1 5 Figure 7 presents the nucleotide sequence of the polynucleotide designated 

gpl40.mut7.modSF162.delV2. 

Figure 8 presents the nucleotide sequence of the polynucleotide designated 
gpl40mod.TVl.delV2. 

Figure 9 presents the nucleotide sequence of the polynucleotide designated 
20 gpl40mod.TVl.mut7.delV2. 

Figure 10 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.Q23-17 (optimized sequence based on Subtype A HIV-1 isolate Q23-17 
from Kenya GenBank Accession AF004885). 

Figure 1 1 presents the nucleotide sequence of the polynucleotide designated 
25 gpl60mod.98UA01 16 (optimized sequence based on Subtype A HIV-1 isolate 
98UA0116 from Ukraine GenBank Accession AF413987). 

Figure 12 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.SE8538 (optimized sequence based on Subtype A HIV-1 isolate SE8538 
from Tanzania GenBank Accession AF069669). 
30 Figure 13 presents the nucleotide sequence of the polynucleotide designated 

gpl60mod.UG031 (optimized sequence based on Subtype A Human 
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immunodeficiency virus 1 proviralDNA, complete genome, clone :pUG031-Al 
GenBank Accession AB098330). 

Figure 14 presents the nucleotide sequence of the polynucleotide designated 
ajl60mod.92UG001 (optimized sequence based on Subtype D Human 
5 immunodeficiency virus type 1 complete proviral genome, strain 92UG001 GenBank 
Accession AJ320484). 

Figure 15 presents the nucleotide sequence of the polynucleotide designated 
gp 1 60mod.94UGl 14 (optimized sequence based on Subtype D HIV- 1 isolate 
94UG1 14 from Uganda GenBank Accession U88824). 
1 0 Figure 1 6 presents the nucleotide sequence of the polynucleotide designated 

gpl60mod.ELI (optimized sequence based on Subtype D Human immunodeficiency 
virus type 1, isolate ELIGenBank Accession K03454). 

Figure 17 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.93IN101 (optimized sequence based on Indian Subtype C Human 
1 5 immunodeficiency virus type 1 subtype C genomic RNA GenBank Accession 
AB023804). 

Figure 18 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.cm235.V3con (optimized sequence based on Subtype E HIV-1 isolate). 

Figure 19 presents the nucleotide sequence of the polynucleotide designated 
20 gpl 60partialmod.cm235.V3 con (optimized sequence based on Subtype E HIV-1 
isolate). 

Detailed Description of the Invention 

The practice of the present invention will employ, unless otherwise indicated, 
25 conventional methods of chemistry, biochemistry, molecular biology, immunology 

and pharmacology, within the skill of the art. Such techniques are explained fully in 
the literature. See, e.g.. Remington 's Pharmaceutical Sciences, 18th Edition (Easton, 
Pennsylvania: Mack Publishing Company, 1990); Methods In Enzymology (S. 
Colowick and N. Kaplan, eds.. Academic Press, Inc.); and Handbook of Experimental 
30 Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell, eds., 1986, Blackwell 

Scientific Publications); Sambrook, et al.. Molecular Cloning: A Laboratory Manual 
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(2nd Edition, 1989); Short Protocols in Molecular Biology, 4th ed. (Ausubel et al. 
eds., 1999, John Wiley & Sons); Molecular Biology Techniques: An Intensive 
Laboratory Course, (Ream et al, eds., 1998, Academic Press); PCR (Introduction to 
Biotechniques Series), 2nd ed. (Newton & Graham eds., 1997, Springer Verlag). 
5 All patents, publications, sequence citations, and patent applications cited in 

this specification are herein incorporated by reference as if each individual patent, 
publication, sequence citation, or patent application was specifically and individually 
indicated to be incorporated by reference in its entirety for all purposes. 

As used in this specification, the singular forms "a," "an" and "the" include 
1 0 plural references unless the content clearly dictates otherwise. Thus, for example, 
reference to "an antigen" includes a mixture of two or more such agents. 

1.0.0 Definitions 

In describing the present invention, the following terms will be employed, and 

1 5 are intended to be defined as indicated below. 

"Synthetic" sequences, as used herein, refers to HIV polypeptide-encoding 
polynucleotides whose expression has been modified as described herein, for example, 
by codon substitution, altered activities, and/or inactivation of inhibitory sequences. 
"Wild-type" or "native" sequences, as used herein, refer to polypeptide-encoding 

20 polynucleotides that are substantially as they are found in nature, e.g., Gag, Pol, Vif, 
Vpr, Tat, Rev, Vpu, Env and/or Nef encoding sequences as found in HIV isolates, e.g., 
SF162, SF2, AF110965, AF110967, AFl 10968, AFl 10975, MJ4 (asubtype C, 
Ndung'u et al. (2001) J. Virol. 75:4964-4972), Subtype B-SF162, Subtype C-TV1.8_2 
(8_2_TV1_C.ZA), Subtype C-TV1.8_5 (8_5_TV1_C.ZA), Subtype C-TV2.12-5/1 

25 (12-5_1_TV2_C.ZA), Subtype C-MJ4, India Subtype C-93IN101, Subtype A-Q2317, 
Subtype D-92UG001, Subtype E-cm235, Subtype A HIV-1 isolate Q23-17 from 
Kenya GenBank Accession AF004885, Subtype A HIV-l isolate 98UA0116 from 
Ukraine GenBank Accession AF413987, Subtype A HIV-1 isolate SE8538 from 
Tanzania GenBank Accession AF069669, Subtype A Human iirununodeficiency virus 

30 1 proviral DNA, complete genome, clone:pUG03 1 -Al GenBank Accession 

AB098330, Subtype D Human immunodeficiency virus type 1 complete proviral 
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genome, strain 92UG001 GenBank Accession AJ320484, Subtype D HIV-l isolate 
94UG1 14 from Uganda GenBank Accession U88824, Subtype D Human 
immunodeficiency virus type 1, isolate ELIGenBank Accession K03454, and Indian 
Subtype C Human immunodeficiency virus type 1 subtype C genomic RNA GenBank 
5 Accession AB023804. 

The various regions of the HIV genome are shown in Table 1, with numbering 
relative to 8_5_TV1_C.ZA (Figures 1-A-lD). Thus, the term "Pol" refers to one or 
more of the following polypeptides: polymerase (p6Pol); protease (prot); reverse 
transcriptase (p66RT or RT); RNAseH (plSRNAseH); and/or integrase (pSlInt or Int). 

10 Identification of gene regions for any selected HTV isolate (e.g., strains within a 
subtype, or strains derived from different subtypes) can be performed by one of 
ordinary skill in the art based on the teachings presented herein and the information 
known in the art, for example, by performing nucleotide and/or polypeptide 
alignments relative to 8_5_TV1_C.ZA (polynucleotide sequence presented in Figures 

15 1 A- ID) or alignment to other known HIV isolates, for example, Subtype B isolates 
with gene regions (e.g., SF2, GenBank Accession number K02007; SF162, GenBank 
Accession Number M38428) and Subtype C isolates with gene regions (e.g., GenBank 
Accession Number AFl 10965 and GenBank Accession Number AFl 10975). 
As used herein, the term "virus-like particle" or "VLP" refers to a 

20 nonreplicating, viral shell, derived from any of several viruses discussed further 
below. VLPs are generally composed of one or more viral proteins, such as, but not 
limited to those proteins referred to as capsid, coat, shell, surface and/or envelope 
proteins, or particle-forming polypeptides derived from these proteins. VLPs can form 
spontaneously upon recombinant expression of the protein in an appropriate 

25 expression system. Methods for producing particular VLPs are known in the art and 
discussed more fiilly below. The presence of VLPs following recombinant expression 
of viral proteins can be detected using conventional techniques known in the art, such 
as by electron microscopy. X-ray crystallography, and the like. See, e.g., Baker et al., 
Biophys. J. (1991) 60:1445-1456; Hagensee et al., J. Virol. (1994) 68:4503-4505. For 

30 example, VLPs can be isolated by density gradient centrifiigation and/or identified by 
characteristic density banding. Alternatively, cryoelectron microscopy can be 
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performed on vitrified aqueous samples of the VLP preparation in question, and 
images recorded under appropriate exposure conditions. 

By "particle-forming polypeptide" derived from a particular viral protein is 
meant a full-length or near full-length viral protein, as well as a fragment thereof, or a 
viral protein with internal deletions, which has the ability to form VLPs under 
conditions that favor VLP formation. Accordingly, the polypeptide may comprise the 
full-length sequence, fragments, truncated and partial sequences, as well as analogs 
and precursor forms of the reference molecule. The term therefore intends deletions, 
additions and substitutions to the sequence, so long as the polypeptide retains the 
ability to form a VLP. Thus, the term includes natural variations of the specified 
polypeptide since variations in coat proteins often occur between viral isolates. The 
term also includes deletions, additions and substitutions that do not naturally occur in 
the reference protein, so long as the protein retains the ability to form a VLP. 
Preferred substitutions are those which are conservative in nature, i.e., those 
substitutions that take place within a family of amino acids that are related in their side 
chains. Specifically, amino acids are generally divided into four families: (1) acidic ~ 
aspartate and glutamate; (2) basic ~ lysine, arginine, histidine; (3) non-polar ~ 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; 
and (4) uncharged polar - glycine, asparagine, glutamine, cystine, serine threonine, 
tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic 
amino acids. 

The term "HIV polypeptide" refers to any amino acid sequence that exhibits 
sequence homology to native HIV polypeptides (e.g.. Gag, Env, Prot, Pol, RT, Int, vif, 
vpr, vpu, tat, rev, nef and/or combinations thereof) and/or which is functional. Non- 
limiting examples of fimctions that maybe exhibited by HIV polypeptides include, use 
as immunogens {e.g., to generate a humoral and/or cellular immune response), use in 
diagnostics (e.g, bound by suitable antibodies for use in ELIS As or other 
immunoassays) and/or polypeptides which exhibit one or more biological activities 
associated with the wild type or synthetic HIV polypeptide. For example, as used 
herein, the term "Gag polypeptide" may refer to a polypeptide that is bound by one or 
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more anti-Gag antibodies; elicits a humoral and/or cellular immune response; and/or 
exhibits the ability to form particles. 

An "antigen" refers to a molecule containing one or more epitopes (either 
linear, conformational or both) that will stimulate a host's immune system to make a 
5 humoral and/or cellular antigen-specific response. The term is used interchangeably 
with the term "immunogen." Normally, a B-cell epitope will include at least about 5 
amino acids but can be as small as 3-4 amino acids. A T-cell epitope, such as a CTL 
epitope, will include at least about 7-9 amino acids, and a helper T-cell epitope at least 
about 12-20 amino acids. Normally, an epitope will include between about 7 and 15 

10 amino acids, such as, 9, 10, 12 or 15 amino acids. The term "antigen" denotes both 
subunit antigens, (i.e., antigens which are separate and discrete from a whole organism 
with which the antigen is associated in nature), as well as, killed, attenuated or 
inactivated bacteria, viruses, fungi, parasites or other microbes. Antibodies such as 
anti-idiotype antibodies, or fragments thereof, and synthetic peptide mimotopes, which 

15 can mimic an antigen or antigenic determinant, are also captured under the definition 
of antigen as used herein. Similarly, an oligonucleotide or polynucleotide which 
expresses an antigen or antigenic determinant in vivo, such as in gene therapy and 
DNA immunization applications, is also included in the definition of antigen herein. 
For purposes of the present invention, antigens (e.g., polynucleotide encoding 

20 antigens, or polypeptides comprising antigens) can be derived from any 

microorganism having more than one subtype, serotype, or strain variation (e.g., 
viruses, bacteria, parasites, fimgi, etc.). The term also intends any of the various 
tumor antigens. Furthermore, for purposes of the present invention, an "antigen" 
refers to a protein which includes modifications, such as deletions, additions and 

25 substitutions (generally conservative in nature), to the native sequence, so long as the 
protein maintains the ability to elicit an immunological response, as defined herein. 
These modifications may be deliberate, as through site-directed mutagenesis, or may 
be accidental, such as through mutations of hosts which produce the antigens. 

An "immunological response" to an antigen or composition is the development 

30 in a subject of a humoral and/or a cellular immune response to an antigen present in 
the composition of interest. For purposes of the present invention, a "humoral 
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immune response" refers to an immune response mediated by antibody molecules, 
while a "cellular immune response" is one mediated by T-lymphocytes and/or other 
white blood cells. One important aspect of cellular immunity involves an antigen- 
specific response by cytolytic T-cells ("CTL"s). CTLs have specificity for peptide 
5 antigens that are presented in association with proteins encoded by the major 

histocompatibihty complex (MHC) and expressed on the surfaces of cells. CTLs help 
induce and promote the destruction of intracellular microbes, or the lysis of cells 
infected with such microbes. Another aspect of cellular immunity involves an 
antigen-specific response by helper T-cells. Helper T-cells act to help stimulate the 

10 function, and focus the activity of, nonspecific effector cells against cells displaying 
peptide antigens in association with MHC molecules on their surface. A "cellular 
immune response" also refers to the production of cytokines, chemokines and other 
such molecules produced by activated T-cells and/or other white blood cells, including 
those derived from CD4+ and CD8+ T-cells. 

15 A composition or vaccine that elicits a cellular immime response may serve to 

sensitize a vertebrate subject by the presentation of antigen in association with MHC 
molecules at the cell surface. The cell-mediated immune response is directed at, or 
near, cells presenting antigen at their surface. In addition, antigen-specific T- 
lymphocytes can be generated to allow for the future protection of an immunized host. 

20 The ability of a particular antigen to stimulate a cell-mediated immunological 

response may be determined by a number of assays, such as by lymphoproliferation 
(lymphocyte activation) assays, CTL cytotoxic cell assays, or by assaying for T- . 
lymphocytes specific for the antigen in a sensitized subject. Such assays are well 
known in the art. See, e.g., Erickson et al., J. Immunol. (1993) 151:4189-4199; Doe et 

25 al., Eur. J. Immunol. (1994) 24:2369-2376. Recent methods of measuring cell- 
mediated immune response include measurement of intracellular cytokines or cytokine 
secretion by T-cell populations, or by measurement of epitope specific T-cells (e.g., by 
the tetramer technique)(reviewed by McMichael, A.J., and O'Callaghan, C.A., J. Exp. 
Med. 187(9)1367-1371, 1998; Mcheyzer-WiUiams, M.G., et al, Immunol. Rev. 150:5- 

30 21, 1996; Lalvani, A., et al, J. Exp. Med. 186:859-865, 1997). 
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Thus, an immunological response as used herein may be one that stimulates the 
production of antibodies (e.g., neutralizing antibodies that block bacterial toxins and 
pathogens such as viruses entering cells and replicating by binding to toxins and 
pathogens, typically protecting cells from infection and destruction). The antigen of 
5 interest may also elicit production of CTLs. Hence, an immunological response may 
include one or more of the following effects: the production of antibodies by B-cells; 
and/or the activation of suppressor T-cells and/or memory/effector T-cells directed 
specifically to an antigen or antigens present in the composition or vaccine of interest. 
These responses may serve to neutralize infectivity, and/or mediate antibody- 
1 0 complement, or antibody dependent cell cytotoxicity (ADCC) to provide protection to 
an immunized host. Such responses can be determined using standard immunoassays 
and neutralization assays, well known in the art. (See, e.g., Montefiori et al. (1988) J. 
Clin Microbiol. 26:231-235; Dreyer et al. (1999) AIDS Res Hum Retroviruses (1999) 
15(17):1563-1571). 

1 5 An "immunogenic HIV polypeptide" is a polypeptide capable of dieting an 

immune response against one or more native HIV polypeptides, when the 
immunogenic polypeptide is adminstered to a laboratory test animal (such as a mouse, 
guina pig, rhesus macaque, chimpanzee, baboon, etc.). 

An "immunogenic composition" is a composition that comprises an antigenic 

20 molecule where administration of the composition to a subject results in the 

development in the subject of a humoral and/or a cellular immune response to the 
antigenic molecule of interest. The immunogenic composition can be introduced 
directly into a recipient subject, such as by injection, inhalation, oral, intranasal and 
mucosal (e.g., intra-rectally or intra-vaginally) administration. 

25 By "subunit vaccine" is meant a vaccine composition which includes one or 

more selected antigens but not all antigens, derived from or homologous to, an antigen 
from a pathogen of interest such as from a virus, bacterium, parasite or fimgus. Such a 
composition is substantially free of intact pathogen cells or pathogenic particles, or the 
lysate of such cells or particles. Thus, a "subunit vaccine" can be prepared from at 

30 least partially purified (preferably substantially purified) immunogenic polypeptides 
from the pathogen, or analogs thereof The method of obtaining an antigen included 
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in the subimit vaccine can thus include standard purification techniques, recombinant 
production, or synthetic production. 

"Substantially purified" general refers to isolation of a substance (compound, 
polynucleotide, protein, polypeptide, polypeptide composition) such that the substance 

5 comprises the majority percent of the sample in which it resides. Typically in a sample 
a substantially purified component comprises 50%, preferably 80%-85%, more 
preferably 90-95% of the sample. Techniques for purifying polynucleotides and 
polypeptides of interest are well-known in the art and include, for example, ion- 
exchange chromatography, affinity chromatography and sedimentation according to 

10 density. 

A "polynucleotide coding sequence" or a polynucleotide sequence that 
"encodes" a selected polypeptide, is a nucleic acid molecule that is transcribed (in the 
case of DNA) and translated (in the case of mRNA) into a polypeptide in vivo when 
placed under the control of appropriate regulatory sequences (or "control elements"). 

1 5 The boundaries of the coding sequence are determined by a start codon, for example, 
at or near the 5' terminus and a translation stop codon, for example, at or near the 3' 
terminus. A coding sequence can include, but is not limited to, cDNA fix)m viral, 
procaryotic or eucaryotic mRNA, genomic DNA sequences firom viral or procaryotic 
DNA, and even synthetic DNA sequences. Exemplary coding sequences are codon 

20 optimized viral polypeptide-coding sequences used in the present invention. The 
coding regions of the polynucleotide sequences of the present invention are 
identifiable by one of skill in the art and may, for example, be easily identified by 
performing translations of all three frames of the polynucleotide and identifying the 
fi-ame corresponding to the encoded polypeptide, for example, a synthetic nef 

25 polynucleotide of the present invention encodes a nef-derived polypeptide. A 
transcription termination sequence may be located 3' to the coding sequence. 

Typical "control elements", include, but are not limited to, transcription 
regulators, such as promoters, transcription enhancer elements, transcription 
termination signals, and polyadenylation sequences; and translation regulators, such as 

30 sequences for optimization of initiation of translation, e.g., Shine-Dalgamo (ribosome 
binding site) sequences, internal ribosome entry sites (IRES) such as the ECMV IRES, 
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Kozak-type sequences (i.e., sequences for the optimization of translation, located, for 
example, 5' to the coding sequence, e.g., GCCACC placed in front (5') of an initiating 
ATG), leader sequences, translation initiation codon (e.g., ATG), and translation 
termination sequences {e.g., TAA or, preferably, TAAA placed after (3') the coding 
5 sequence). In certain embodiments, one or more translation regulation or initiation 
sequences {e.g., the leader sequence) are derived from wild-type translation initiation 
sequences, i.e., sequences that regulate translation of the coding region in their native 
state. Wild-type leader sequences that have been modified, using the methods 
described herein, also find use in the present invention. Promoters can include 
10 inducible promoters (where expression of a polynucleotide sequence operably linked 
to the promoter is induced by an analyte, cofactor, regulatory protein, etc.), repressible 
promoters (where expression of a polynucleotide sequence operably linked to the 
promoter is induced by an analyte, cofactor, regulatory protein, etc.), and constitutive 
promoters. 

1 5 A "nucleic acid" molecule or "polynucleotide" can include, but is not limited 

to, procaryotic sequences, eucaryotic mRNA, cDNA from eucaryotic mRNA, genomic 
DNA sequences from eucaryotic (e.g., mammalian) DNA, and even synthetic DNA 
sequences. The term also captures sequences that include any of the known base 
analogs of DNA and RNA. In referring to the polynucleotide of the invention, in 

20 those examples in which "DNA" is specifically recited, it will be apparent that for 
many such embodiments, RNA is likewise intended. 

"Operably linked" refers to an arrangement of elements wherein the 
components so described are configured so as to perform their usual fimction. Thus, a 
given promoter operably linked to a coding sequence is capable of effecting the 

25 expression of the coding sequence when the proper enzymes are present. The 

promoter need not be contiguous with the coding sequence, so long as it fimctions to 
direct the expression thereof Thus, for example, intervening untranslated yet 
transcribed sequences can be present between the promoter sequence and the coding 
sequence and the promoter sequence can still be considered "operably linked" to the 

30 coding sequence. 
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"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue 
of its origin or manipulation: (1) is not associated with all or a portion of the 
polynucleotide with which it is associated in nature; and/or (2) is linked to a 
5 polynucleotide other than that to which it is linked in nature. The term "recombinant" 
as us.ed with respect to a protein or polypeptide means a polypeptide produced by 
expression of a recombinant polynucleotide. "Recombinant host cells," "host cells," 
"cells," "cell lines," "cell cultures," and other such terms denoting procaryotic 
microorganisms or eucaryotic cell lines cultured as unicellular entities, are used 

10 interchangeably, and refer to cells which can be, or have been, used as recipients for 
recombinant vectors or other transfer DNA, and include the progeny of the original 
cell which has been transfected. It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in morphology or in genomic or total 
DNA complement to the original parent, due to accidental or deliberate mutation. 

1 5 Progeny of the parental cell which are sufficiently similar to the parent to be 

characterized by the relevant property, such as the presence of a nucleotide sequence 
encoding a desired peptide, are included in the progeny intended by this definition, 
and are covered by the above terms. 

Techniques for determining amino acid sequence "similarity" are well known 

20 in the art. In general, "similarity" means the exact amino acid to amino acid 

comparison of two or more polypeptides at the appropriate place, where amino acids 
are identical or possess similar chemical and/or physical properties such as charge or 
hydrophobicity. A so-termed "percent similarity" then can be determined between the 
compared polypeptide sequences. Techniques for determining nucleic acid and amino 

25 acid sequence identity also are well known in the art and include determining the 
nucleotide sequence of the mRNA for the gene encoding the amino acid sequence 
(usually via a cDNA intermediate) and determining the amino acid sequence encoded 
thereby, and comparing this to a second amino acid sequence. In general, "identity" 
refers to an exact amino acid to amino acid or nucleotide to nucleotide 

30 correspondence of two polypeptide sequences or polynucleotide sequences, 
respectively. 
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Two or more polynucleotide sequences can be compared by determining their 
"percent identity." Two or more amino acid sequences likewise can be compared by 
determining their "percent identity." The percent identity of two sequences, whether 
nucleic acid or peptide sequences, is generally described as the number of exact 
5 matches between two aligned sequences divided by the length of the shorter sequence 
and multiplied by 100. An approximate alignment for nucleic acid sequences is 
provided by the local homology algorithm of Smith and Waterman, Advances in 
Applied Mathematics 2:482-489 (1981). This algorithm can be extended to use with 
peptide sequences using the scoring matrix developed by Dayhoff, Atlas of Protein 

10 Sequences and Structure, M.O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical 
Research Foundation, Washington, D.C., USA, and normalized by Gribskov, Nucl. 
Acids Res. 14(6):6745-6763 (1986). An implementation of this algorithm for nucleic 
acid and peptide sequences is provided by the Genetics Computer Group (Madison, 
WI) in their BestFit utility application. The default parameters for this method are 

1 5 described in the Wisconsin Sequence Analysis Package Program Manual, Version 8 
(1995) (available from Genetics Computer Group, Madison, WI). Other equally 
suitable programs for calculating the percent identity or similarity between sequences 
are generally known in the art. 

For example, percent identity of a particular nucleotide sequence to a reference 

20 sequence can be determined using the homology algorithm of Smith and Waterman 
with a default scoring table and a gap penalty of six nucleotide positions. Another 
method of establishing percent identity in the context of the present invention is to use 
the MPSRCH package of programs copyrighted by the University of Edinburgh, 
developed by John F. Collins and Shane S. Sturrok, and distributed by IntelliGenetics, 

25 Inc. (Mountain View, GA). From this suite of packages, the Smith- Waterman 

algorithm can be employed where default parameters are used for the scoring table 
(for example, gap open penalty of 12, gap extension penalty of one, and a gap of six). 
From the data generated, the "Match" value reflects "sequence identity." Other 
suitable programs for calculating the percent identity or similarity between sequences 

30 are generally known in the art, such as the alignment program BLAST, which can also 
be used with default parameters. For example, in a preferred embodiment, BLASTN 
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and BLAST? can be used with the following default parameters for nucleic acid 
searches - genetic code = standard; filter = none; strand = both; cutoff = 60; expect = 
10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH SCORE; 
Databases = non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank CDS 
5 translations + Swiss protein + Spupdate + PIR; (ii) polypeptide searches - . Details of 
these programs can be foirad at the following internet address: www.ncbi.nlm.gov/cgi- 
bin/BLAST. 

Protein similarity and percent identity sequence searches can be carried out, 
for example, using Smith- Waterman Similarity Search algorithms (e.g., at 

10 www.ncbi.nlm.gov, or from commercial sources, such as, TimeLogic Corporation, 
Crystal Bay, NV). For example, in a preferred embodiment, the Smith- Waterman 
Similarity Search can be used with default parameters, for example, as follows: 
Weight MATRIX = BLOSUM62.MAA; Gap Opening PENALTY = -12; Gap 
Extension PENALTY = -2; FRAME PENALTY = 0; QUERY FORMAT = 

1 5 FASTA/PEARSON; QUERY TYPE = AA; QUERY SEARCH = 1 ; QUERY SET = 
CGI_ld82ws301bde.seq; TARGET TYPE = AA; TARGET SET = NRPdb gsaa; 
SIGNIFICANCE = GAPPED; MAX SCORES = 30; MAX ALIGNMENTS = 20; 
Reporting THRESHOLD = Score=l; ALIGNMENT THRESHOLD = 20. 

One of skill in the art can readily determine the proper search parameters to 

20 use for a given sequence, exemplary preferred Smith Waterman based parameters are 
presented above. For example, the search parameters may vary based on the size of 
the sequence in question. Thus, for polynucleotide sequences of the present invention 
the length of the polynucleotide sequence disclosed herein is searched against a 
selected database and compared to sequences of essentially the same length to 

25 determine percent identity. For example, a representative embodiment of the present 
invention would include an isolated polynucleotide comprising X contiguous 
nucleotides, wherein (i) the X contiguous nucleotides have at least about a selected 
level of percent identity relative to Y contiguous nucleotides of one or more of the 
sequences described herein or fragment thereof, and (ii) for search purposes X equals 

30 Y, wherein Y is a selected reference polynucleotide of defined length (for example, a 
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length of from 15 nucleotides up to the number of nucleotides present in a selected 
full-length sequence). 

The sequences of the present invention can include fragments of the sequences, 
for example, from about 15 nucleotides up to the number of nucleotides present in the 
5 fiiU-length sequences described herein, including all integer values falling within the 
above-described range. For example, fragments of the polynucleotide sequences of 
the present invention may be 30-60 nucleotides, 60-120 nucleotides, 120-240 
nucleotides, 240-480 nucleotides, 480-1000 nucleotides, and all integer values 
therebetween. 

1 0 The synthetic polynucleotides described herein include related polynucleotide 

sequences having about 80% to 100%, greater than 80-85%, preferably greater than 
90-92%, more preferably greater than 95%, and most preferably greater than 98% up 
to 100% (including all integer values falling within these described ranges) sequence 
identity to the synthetic polynucleotide sequences disclosed herein when the 
1 5 sequences of the present invention are used as the query sequence against, for 
example, a database of sequences. 

Two nucleic acid fragments are considered to "selectively hybridize" as 
described herein. The degree of sequence identity between two nucleic acid molecules 
affects the efficiency and strength of hybridization events between such molecules. A 
20 partially identical nucleic acid sequence will at least partially inhibit a completely 
identical sequence from hybridizing to a target molecule. Inhibition of hybridization 
of the completely identical sequence can be assessed using hybridization assays that 
are well known in the art (e.g., Southern blot, Northern blot, solution hybridization, or 
the like, see Sambrook, et al., supra or Ausubel et al., supra). Such assays can be 
25 conducted using varying degrees of selectivity, for example, using conditions varying 
from low to high stringency. If conditions of low stringency are employed, the 
absence of non-specific binding can be assessed using a secondary probe that lacks 
even a partial degree of sequence identity (for example, a probe having less than about 
30% sequence identity with the target molecule), such that, in the absence of non- 
30 specific binding events, the secondary probe will not hybridize to the target. 
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When utilizing a hybridization-based detection system, a nucleic acid probe is 
chosen that is complementary to a target nucleic acid sequence, and then by selection 
of appropriate conditions the probe and the target sequence "selectively hybridize," or 
bind, to each other to form a hybrid molecule. A nucleic acid molecule that is capable 
5 of hybridizing selectively to a target sequence under "moderately stringent" typically 
hybridizes under conditions that allow detection of a target nucleic acid sequence of at 
least about 10-14 nucleotides in length having at least approximately 70% sequence 
identity with the sequence of the selected nucleic acid probe. Stringent hybridization 
conditions typically allow detection of target nucleic acid sequences of at least about 

10 10-14 nucleotides in length having a sequence identity of greater than about 90-95% 
with the sequence of the selected nucleic acid probe. Hybridization conditions useful 
for probe/target hybridization where the probe and target have a specific degree of 
sequence identity, can be determined as is known in the art (see, for example, Nucleic 
Acid Hybridization: A Practical Approach, editors B.D. Hames and S.J. Higgins, 

1 5 (1 985) Oxford; Washington, DC; IRL Press). 

With respect to stringency conditions for hybridization, it is well known in the 
art that numerous equivalent conditions can be employed to establish a particular 
stringency by varying, for example, the following factors: the length and nature of 
probe and target sequences, base composition of the various sequences, concentrations 

20 of salts and other hybridization solution components, the presence or absence of 
blocking agents in the hybridization solutions (e.g., formamide, dextran sulfate, and 
polyethylene glycol), hybridization reaction temperature and time parameters, as well 
as, varying wash conditions. The selection of a particular set of hybridization 
conditions is selected following standard methods in the art (see, for example, 

25 Sambrook, et al., supra or Ausubel et al., supra). 

A first polynucleotide is "derived from" second polynucleotide if the first 
polynucleotide has the same basepair sequence as a region of the second 
polynucleotide, its cDNA, complements thereof, or if the first polynucleotide displays 
subtantial sequence identity to a region of the second polynucleotide, its cDNA, 

30 complements thereof, wherein sequence identity is determined as described above. 
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Substantial sequence identity is typically about 90% or greater, preferably about 95% 
or greater, more preferably about 98% or greater. 

A first polypeptide is "derived fi-om" a second polypeptide if it is encoded by a 
first polynucleotide derived fi-om a second polynucleotide, or the first polypeptide has 
5 the same amino acid sequence as the second polypeptide or a portion thereof, or the 
first polypeptide displays substantial sequence identity to the second polypeptide or a 
portion thereof, wherein sequence identity is determined as described above. 
Substantial sequence identity is typically about 90% or greater, preferably about 95% 
or greater, more preferably about 98% or greater. 

10 Generally, a viral polypeptide is "derived from" a particular polypeptide of a 

virus (viral polypeptide) if it is (i) encoded by the same open reading fiame of a 
polynucleotide of that virus (viral polynucleotide), or (ii) displays substantial sequence 
identity to a polypeptide of that virus as described above. 

A polypeptide is "derived fi-om" an HIV subtype if it is derived fi-om a 

15 polypeptide present in a member of the subtype, derived fi-om a polypeptide encoded 
by a polynucleotide present in a member of the subtype, encoded by a polynucleotide 
that is derived from a polynucleotide present in a member of the subtype, or derived 
from a polypeptide encoded by a polynucleotide that is derived from a polynucleotide 
present in a member of the subtype. 

20 "Analogous polypeptides" refers to polypeptides that are encoded by, or 

derived from polypeptides encoded by, the same gene of the same organism but from 
different polynucleotide sources. In the context of the present invention, different 
polynucleotide sources could be different subtypes, different serotypes or different 
strains. Thus, for example, a Gag polypeptide from a Subtype B HIV would be an 

25 analogous polypeptide to a Gag polypeptide from a Subtype C HIV, or an envelope 
polypeptide derived from a first HIV-1 subtype, serotype, or strain would be an 
analogous polypeptide to an envelope polypeptide derived from a second HIV-1 
subtype, serotype, or strain. Examples of types of analogous polypeptides that could 
be derived from different HIV-1 subtypes or strains include, the envelope polypeptides 

30 gp4 1 , gp 1 20, gp 1 40, and gp 1 60, ail of which are considered analogous polypeptides. 
Further, such analogous polypeptides may each comprise different alterations or 
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mutations, for example, analogous polypeptides derived from the HIV-l envelope 
gene include, but are not limited to, the following: a gp41 polypeptide, a gpl20 
polypeptide, a gpl40 polypeptide, a gpl60 polypeptide, a gpl40 comprising a deletion 
of a portion of the VI loop, a gpl40 polypeptide comprising a deletion of a portion of 
5 the V2 loop, a gp 140 polypeptide comprising a deletion of a portion of the V3 loop, a 
gpl40 polypeptide with a mutated protease cleavage site, a gpl60 comprising a 
deletion of a portion of the VI loop, a gpl60 polypeptide comprising a deletion of a 
portion of the V2 loop, a gp 160 polypeptide comprising a deletion of a portion of the 
V3 loop, and a gpl60 polypeptide with a mutated protease cleavage site. Thus, for 

1 0 example, a gp 1 60 polypeptide from a Subtype B HIV is an analogous polypeptide to a 
gp 140 polypeptide from a Subtype C HIV. 

A "gene" as used in the context of the present invention is a sequence of 
nucleotides in a genetic nucleic acid (viral genome, chromosome, plasmid, etc.) with 
which a genetic function is associated. A gene is a hereditary unit, for example of an 

15 organism comprising a polynucleotide sequence (e.g., an RNA sequence for HIV-l or 
a proviral HIV-l DNA sequence), that occupies a specific physical location (a "gene 
locus" or "genetic locus") within the genome of an organism. A gene can encode an 
expressed product, such as a polypeptide or a polynucleotide (e.g., tRNA). 
Alternatively, a gene may define a genomic location for a particular event/function, 

20 such as the binding of proteins andyor nucleic acids (e.g., 5' LTR), wherein the gene 
does not encode an expressed product. Examples of HIV-l genes include, but are not 
limited to. Gag, Env, Pol (prot, RNase, Int), tat, rev, nef, vif, vpr, and vpu. A gene 
may include coding sequences, such as, polypeptide encoding sequences, and non- 
coding sequences, such as, promoter sequences, poly-adenylation sequences, 

25 transcriptional regulatory sequences (e.g., enhancer sequences). Many eucaryotic 
genes have "exons" (coding sequences) interrupted by "introns" (non-coding 
sequences). In certain cases, a gene may share sequences with another gene(s) (e.g., 
overlapping genes). It is noted that in the general population, wild-type genes may 
include multiple prevalent versions that contain alterations in sequence relative to each 

30 other. These variations are designated "polymorphisms" or "allelic variations." 



24 



PP20912.001 
PATENT 

"Purified polynucleotide" refers to a polynucleotide of interest or fi-agment 
thereof that is essentially firee, e.g., contains less than about 50%, preferably less than 
about 30%, and more preferably less than about 10%, of the protein with which the 
polynucleotide is naturally associated. Techniques for purifying polynucleotides of 
5 interest are well-known in the art and include, for example, disruption of the cell 
containing the polynucleotide with a chaotropic agent and separation of the 
polynucleotide(s) and proteins by ion-exchange chromatography, affinity 
chromatography and sedimentation according to density. 

By "nucleic acid immunization" is meant the introduction of a nucleic acid 

10 molecule encoding one or more selected antigens into a host cell, for the in vivo 

expression of an antigen, antigens, an epitope, or epitopes. The nucleic acid molecule 
can be introduced directly into a recipient subject, such as by injection, inhalation, 
oral, intranasal and mucosal administration, or the like, or can be introduced ex vivo, 
into cells which have been removed from the host. In the latter case, the transformed 

1 5 cells are reintroduced into the subject where an immune response can be mounted 
against the antigen encoded by the nucleic acid molecule. 

"Gene transfer" or "gene delivery" refers to methods or systems for reUably 
inserting nucleic acid (i.e., DNA or RNA) of interest into a host cell. Such methods 
can result in transient expression of non-integrated transferred DNA, 

20 extrachromosomal replication and expression of transferred repUcons (e.g., episomes), 
or integration of transferred genetic material into the genomic DNA of host cells. 
Gene delivery expression vectors include, but are not limited to, vectors derived from 
adenoviruses, adeno-associated viruses, alphaviruses, herpes viruses, measles viruses, 
polio viruses, pox viruses, vesiculoviruses and vaccinia viruses. When used for 

25 immunization, such gene delivery expression vectors may be referred to as vaccines or 
vaccine vectors. 

The term "transfection" is used to refer to the uptake of foreign DNA by a cell. 
A cell has been "transfected" when exogenous DNA has been introduced inside the 
cell membrane. A number of transfection techniques are generally known in the art. 
30 See, e.g., Graham et al. ( 1 973) Virology, 52:456, Sambrook et al. ( 1 989) Molecular 
Cloning, a laboratory manual. Cold Spring Harbor Laboratories, New York, Davis et 
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al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 
13: 197. Such techniques can be used to introduce one or more exogenous DNA 
moieties into suitable host cells. The term refers to both stable and transient uptake of 
the genetic material, and includes uptake of peptide- or antibody-linked DNAs. 
5 A "vector" is capable of transferring gene sequences to target cells (e.g., viral 

vectors, non-viral vectors, particulate carriers, and liposomes). Typically, "vector 
construct," "expression vector," and "gene transfer vector," mean any nucleic acid 
construct capable of directing the expression of a gene of interest and which can be 
used to transfer gene sequences to target cells. Thus, the term includes cloning and 

10 expression vehicles, as well as viral vectors. 

"Lentiviral vector", and "recombinant lentiviral vector" refer to a nucleic acid 
construct which carries, and within certain embodiments, is capable of directing the 
expression of a nucleic acid molecule of interest. The lentiviral vector include at least 
one transcriptional promoter/enhancer or locus defining element(s), or other elements 

1 5 which control gene expression by other means such as alternate splicing, nuclear RNA 
export, post-translational modification of messenger, or post-transcriptional 
modification of protein. Such vector constructs must also include a packaging signal, 
long terminal repeats (LTRS) or portion thereof, and positive and negative strand 
primer binding sites appropriate to the retrovirus used (if these are not already present 

20 in the retroviral vector). Optionally, the recombinant lentiviral vector may also 

include a signal which directs polyadenylation, selectable markers such as Neo, TK, 
hygromycin, phleomycin, histidinol, or DHFR, as well as one or more restriction sites 
and a translation termination sequence. By way of example, such vectors typically 
include a 5' LTR, a tRNA binding site, a packaging signal, an origin of second strand 

25 DNA synthesis, and a 3 'LTR or a portion thereof 

"Lentiviral vector particle" as utilized within the present invention refers to a 
lentivirus which carries at least one gene of interest. The retrovirus may also contain a 
selectable marker. The recombinant lentivirus is capable of reverse transcribing its 
genetic material (RNA) into DNA and incorporating this genetic material into a host 

30 cell's DNA upon infection. Lentiviral vector particles may have a lentiviral envelope. 
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a non-lentiviral envelope (e.g., an ampho or VSV-G envelope), or a chimeric 
envelope. 

"Alphaviral vector", and "recombinant alphaviral vector" and "alphaviral 
replicon vector" refer to a nucleic acid construct which carries, and within certain 
5 embodiments, is capable of directing the expression of a nucleic acid molecule of 
interest. The alphaviral vector includes at least one transcriptional promoter/enhancer 
or other elements which control gene expression by other means such as alternate 
splicing, nuclear RNA export, post-translational modification of messenger, or post- 
transcriptional modification of protein. Such vector constructs must also include a 

10 packaging signal, and alphaviral replication recognition sequences. Optionally, the 
recombinant alphaviral vector may also include a signal which directs 
polyadenylation, selectable markers such as Neo, TK, hygromycin, phleomycin, 
histidinol, or DHFR, as well as one or more restriction sites and a translation 
termination sequence. Typically, the alphaviral vector will include coding sequences 

1 5 for the alphaviral non-structural proteins, a packaging site, replication recognition 
sequences and a sequence capable of directing the expression of the nucleic acid 
molecule of interest. 

"Expression cassette" refers to an assembly which is capable of directing the 
expression of a sequence or gene of interest. An expression cassette typically includes 

20 a promoter which is operably linked to the polynucleotide sequences or gene(s) of 
interest. Other control elements may be present as well. Expression cassettes 
described herein may be contained within a plasmid construct. In addition to the 
components of the expression cassette, the plasmid construct may also include a 
bacterial origin of replication, one or more selectable markers, a signal which allows 

25 the plasmid construct to exist as single-stranded DNA (e.g., a M13 origin of 

replication), a multiple cloning site, and a "mammalian" origin of replication (e.g., a 
SV40 or adenovirus origin of replication). 

"Packaging cell" refers to a cell that comprises those elements necessary for 
production of infectious recombinant viral vector, but which lack the recombinant 

30 viral vector. Typically, such packaging cells contain one or more expression cassettes 
that are capable of expressing proteins necessary for the replication and packaging of 
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an introduced vector, for example, in the case of a lentiviral vector expression 
cassettes which encode Gag, pol and env proteins, in the case of an alphaviral vector, 
expression cassettes that encode alphaviral structural proteins. 

"Producer cell" or "vector producing cell" refers to a cell which contains all 
5 elements necessary for production of recombinant viral vector particles. 

Transfer of a "suicide gene" (e.g., a drug-susceptibility gene) to a target cell 
renders the cell sensitive to compounds or compositions that are relatively nontoxic to 
normal cells. Moolten, F.L. (1994) Cancer Gene Ther. 1:279-287. Examples of 
suicide genes are thymidine kinase of herpes simplex virus (HSV-tk), cytochrome 

10 P450 (Manome et al. (1996) Gene Therapy 3:513-520), human deoxycytidine kinase 
(Manome et al. (1996) Nature Medicine 2(5):567-573) and the bacterial enzyme 
cytosine deaminase (Dong et al. (1996) Human Gene Therapy 7:713-720). Cells 
which express these genes are rendered sensitive to the effects of the relatively 
nontoxic prodrugs ganciclovir (HSV-tk), cyclophosphamide (cytochrome P450 2B1), 

1 5 cytosine arabinoside (human deoxycytidine kinase) or 5-fluorocytosine (bacterial 

cytosine deaminase). Culver et al. (1992) Science 256:1550-1552, Huber et al. (1994) 
Proc. Natl. Acad. Sci. USA 91 :8302-8306. 

A "selectable marker" or "reporter marker" refers to a nucleotide sequence 
included in a gene transfer vector that has no therapeutic activity, but rather is 

20 included to allow for simpler preparation, manufacturing, characterization or testing of 
the gene transfer vector. 

A "specific binding agent" refers to a member of a specific binding pair of 
molecules wherein one of the molecules specifically binds to the second molecule 
through chemical and/or physical means. One example of a specific binding agent is 

25 an antibody directed against a selected antigen. 

By "subject" is meant any member of the subphylum chordata, including, 
without limitation, humans and other primates, including non-himian primates such as 
baboons, rhesus macaque, chimpanzees and other apes and monkey species; farm 
animals such as cattle, sheep, pigs, goats and horses; domestic mammals such as dogs 

30 and cats; laboratory animals including rodents such as mice, rats, rabbits, and guinea 
pigs; birds, including domestic, wild and game birds such as chickens, turkeys and 
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other gallinaceous birds, ducks, geese, and the like. The term does not denote a 
particular age. Thus, both adult and newborn individuals are intended to be covered. 
The system described above is intended for use in any of the above vertebrate species, 
since the immune systems of all of these vertebrates operate similarly. 
5 By "subtype" is meant a phylogenetic classification of similar organisms into 

groups based on similarities at the genetic (i.e., nucleic acid sequence) level. Such 
groups are designated "subtypes." In the HIV field, a well known and widely 
accepted centralized organization for the determination of such similarities and 
classification of particular viral isolates into subtypes is the Los Alamos National 

10 Laboratory. The HIV subtypes referred to herein are those as determined by the Los 
Alamos National Laboratory. (See, e.g., Myers, et al., Los Alamos Database, Los 
Alamos National Laboratory, Los Alamos, New Mexico; Myers, et al., Human 
Retroviruses and Aids, 1990, Los Alamos, New Mexico: Los Alamos National 
Laboratory.) A subtype can also be referred to as a "clade." 

1 5 By "serotype" is meant a classification of similar organisms based on antibody 

cross-reactivity. 

By "strain" is intended an organism fi-om within the subtype but which is 
differentiated fi-om other members of the same subtype based on differences in nucleic 
acid sequence. 

20 By "pharmaceutically acceptable" or "pharmacologically acceptable" is meant 

a material which is not biologically or otherwise undesirable, i.e., the material may be 
administered to an individual in a formulation or composition without causing any 
undesirable biological effects or interacting in a deleterious manner with any of the 
components of the composition in which it is contained. 

25 By "physiological pH" or a "pH in the physiological range" is meant a pH in 

the range of approximately 7.0 to 8.0 inclusive, more typically in the range of 
approximately 7.2 to 7.6 inclusive. 

As used herein, "treatment" refers to any of (i) the prevention of infection or 
reinfection, as in a traditional vaccine, (ii) the reduction or elimination of symptoms, 

30 or (iii) the substantial or complete elimination of the pathogen in question. Treatment 
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may be effected prophylactically (prior to infection) or therapeutically (following 
infection). 

By "co-administration" is meant administration of more than one composition, 
component of a composition, or molecule. Thus, co-administration includes 
concurrent administration or sequentially administration (in any order), via the same 
or different routes of administration. Non-limiting examples of co-administration 
regimes include, co-administration of nucleic acid and polypeptide; co-administration 
of different nucleic acids (e.g., different expression cassettes as described herein 
and/or different gene delivery vectors); and co-administration of different polypeptides 
(e.g., different HIV polypeptides and/or different adjuvants). The term also 
encompasses multiple administrations of one of the co-administered molecules or 
compositions (e.g., multiple administrations of one or more of the expression cassettes 
described herein followed by one or more administrations of a polypeptide-containing 
composition), hi cases where the molecules or compositions are delivered 
sequentially, the time between each administration can be readily determined by one 
of skill in the art in view of the teachings herein. 

"T lymphocytes" or "T cells" are non-antibody producing lymphocytes that 
constitute a part of the cell-mediated arm of the immune system. T cells arise from 
immature lymphocytes that migrate from the bone marrow to the thymus, where they 
undergo a maturation process under the direction of thymic hormones. Here, the 
mature lymphocytes rapidly divide increasing to very large nimibers. The maturing T 
cells become immunocompetent based on their ability to recognize and bind a specific 
antigen. Activation of immunocompetent T cells is triggered when an antigen binds to 
the lymphocyte's surface receptors. 

2.0.0 Modes of Carrying Out the Invention 

Before describing the present invention in detail, it is to be understood that this 
invention is not limited to particular formulations or process parameters as such may, 
of course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments of the invention only, and is not 
intended to be limiting. 
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Although a number of methods and materials similar or equivalent to those 
described herein can be used in the practice of the present invention, the preferred 
materials and methods are described herein. 



5 2.1.0 General Overview of the Invention 

The present invention relates to combination approaches to generate immune 
responses in subjects using compositions comprising immunogenic polynucleotides 
and polypeptides. 

In one general aspect of the present invention, a polynucleotide component of 

1 0 the present invention consists essentially of one polynucleotide encoding a 

immunogenic polypeptide derived from a microorganism (e.g., virus, bacteria, fungi, 
etc.), and a polypeptide component that comprises one or more immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one immunogenic polypeptide of the polypeptide 

1 5 component is derived from a different subtype, serotype, or strain of the 

microorganism than the immunogenic polypeptide encoded by the polynucleotide 
component. In this context, the polynucleotide component consisting essentially of 
one polynucleotide encoding an immunogenic polypeptide refers to the presence of 
one polynucleotide encoding one immunogenic polypeptide in the composition. The 

20 polynucleotide composition may comprise further components, such as immune 

enhancers, immunoregulatory components, vector sequences (e.g., viral or non-viral), 
carriers, particles, excipients, expression control sequences, etc. In addition, the 
polynucleotide component may include further components such as molecules to 
enhance the immune response (e.g., liposomes, PLG, particles, alum, etc.). Further, 

25 the polypeptide component may comprise further components, such as, immune 
enhancers, imiriunoregulatory components, adjuvants, carriers, particles, excipients, 
etc. 

In a second general aspect of the present invention, there is provided a 
composition comprising a polynucleotide component comprising two or more 
30 polynucleotide sequences comprising coding sequences for two or more analogous 
immunogenic polypeptides derived from a microorganism (e.g., virus, bacteria, fungi, 
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etc.), wherein the coding sequences for at least two of the immunogenic polypeptides 
are derived from different subtypes, serotypes, or strains of the microorganism, and a 
polypeptide component comprising one or more immunogenic polypeptides analogous 
to the polypeptides encoded by said polynucleotide component. In some embodiments 
5 of this aspect of the invention, there is included the proviso that, if the polypeptide 
component provides the same number or more than the number of analogous 
immunogenic polypeptides encoded by the polynucleotide component, then the 
inmiunogenic polypeptides of the polypeptide composition are derived from at least 
one different subtype, serotype, or strain than the inmiunogenic polypeptides provided 

10 by the polynucleotide component. The polynucleotide composition may comprise 
further components, such as immune enhancers, immunoregulatory components, 
vector sequences (e.g., viral or non-viral), carriers, particles, excipients, expression 
control sequences, etc. In addition, the polynucleotide component may include fiirther 
components such as molecules to enhance the immune response (e.g., liposomes, 

1 5 PLG, particles, alum, etc.). Further, the polypeptide component may comprise ftirther 
components, such as, immune enhancers, immunoregulatory components, adjuvants, 
carriers, particles, excipients, etc. 

The invention is exemplified herein with reference to Human 
Immunodeficiency Virus 1 (HIV-1). One of ordinary skill in the art, in view of the 

20 teachings of the present specification, can apply the teachings of the present invention 
to other suitable organisms, for example, microorganisms. The compositions and 
methods of the present invention may, for example, employ polynucleotides encoding 
HIV envelope polypeptides and well as HIV envelope polypeptides, e.g., HIV 
envelope proteins analogous to those encoded by the polynucleotides, to induce broad 

25 and/or potent neutralizing activity against diverse HIV strains. Although described 
with reference to the HIV virus, the compositions and methods of the present 
invention can be applied to other virus families having a variety of subtypes, 
serotypes, and/or strain variations, for example, including but not limited to other non- 
HIV retroviruses (e.g. HTLV-1, 2), hepadnoviruses (e.g. HBV), herpesviruses (e.g. 

30 HSV-1 , 2, CMV, EBV, varizella-zoster, etc.), flavivimses (e.g. HCV, Yellow fever, 
Tick borne encephalitis, St. Louis Encephalitis, West Nile Virus, etc.), coronaviruses 
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(e.g. SARS), paramyxoviruses (e.g., PIV, RSV, measles etc.), influenza viruses, 
picomaviruses, reoviruses (e.g., rotavirus), arenaviruses, rhabdoviruses, 
papovaviruses, parvoviruses, adenoviruses, Dengue virus, bunyaviruses (e.g. , 
hantavirus), calciviruses (e.g. Norwalk virus), filoviruses (e.g. , Ebola, Marburg). 
5 The diversity and mutability of the HIV virus present challenges to HIV 

vaccine development. HIV continues to spread globally, with upwards of 42 million 
people infected with EW (UNAIDS Report on the global HP//AIDS epidemic, 
UNAIDS, Geneva, Switzerland (December 2002). These people are infected with 
different HIV subtypes (and/or strains). The infecting HIV subtype (and/or strain) is 

10 typically geographically dependent. In one aspect, the present invention relates to 
compositions and methods that provide the ability to induce broad and potent 
neutralizing antibodies against the diverse HIV subtypes, serotypes, and/or strains for 
the treatment of infections, reduction of infection risk, reduction of transmission, 
reduction of disease manifestations, and/or prevention of HIV infections arising in 

15 different regions. 

Experiments performed in support of the present invention confirm the use of 
the combination approaches described herein to induce potent and broad HIV- 
neutralization activity. The approaches include immunization with a variety of 
polynucleotides encoding HIV polypeptides derived from different subtypes, 

20 serotypes, or strains combined with immunization using HIV polypeptides derived 
from different subtypes, serotypes, or strains. The invention further hicludes 
immunization using various doses and immxmization regimens of such polynucleotides 
and polypeptides. 

Accordingly, ui a first particular aspect of the present invention, the 

25 polynucleotide component of the present invention consists essentially of one 
polynucleotide encoding an HIV immunogenic polypeptide, and the polypeptide 
component comprises of one or more HIV immunogenic polypeptides analogous to 
the polypeptide encoded by said polynucleotide component, with the proviso that at 
least one HIV immunogenic polypeptide of the polypeptide component is derived 

30 from a different HIV subtype, serotype, or strain than the immvmogenic polypeptide 
encoded by the polynucleotide component. In this context, consists essentially of one 



33 



PP20912.001 
PATENT 

polynucleotide refers to the presence of one polynucleotide sequence encoding one 
HIV immunogenic polypeptide in the polynucleotide composition. The 
polynucleotide composition may comprise further components, such as immune 
enhancers, immunoregulatory components, vector sequences (e.g., viral or non- viral), 
5 carriers, particles, excipients, expression control sequences, etc. In one embodiment 
of the present invention, the HIV immunogenic polypeptide encoded by the 
polynucleotide component is derived from subtype B, and at least one coding 
sequence of an HIV immunogenic polypeptide of the polypeptide component is 
derived from an HIV subtype other than subtype B, for example, subtype C, subtype 

10 A, subtype D, subtype E, subtype F, subtype G, subtype H, subtype I, subtype J, 
subtype K, subtype N or subtype O. In another embodiment, the HIV immunogenic 
polypeptide encoded by the polynucleotide component is derived from a first strain of 
a first subtype (e.g., a first Subtype B strain), and at least one coding sequence of an 
HIV immunogenic polypeptide of the polypeptide component is derived from a second 

1 5 strain of the first subtype (e.g., a second Subtype B strain). 

In one embodiment, a polynucleotide and a polypeptide from different HFV 
subtypes, serotypes, or strains are used for priming and boosting, i.e., a polynucleotide 
encoding an immunogenic HIV polypeptide is used for immunization via delivery of 
the polynucleotide (e.g., a prime), an analogous immunogenic HIV polypeptide 

20 derived from a different HTV subtype, serotype, or strain is used for immunization 
(e.g., a boost). For example, a polynucleotide is used for nucleic acid immunization, 
wherein the polynucleotide encodes an HIV gpl40 envelope polypeptide (i) derived 
from a South African HIV Subtype C isolate/sfrain, (ii) that is codon optimized for 
expression in mammalian cells, and (iii) is mutated by deletion of the V2 loop (e.g., 

25 gp 1 40mod.TVl .delV2, as described for example in PCT International Publication No. 
WO/02/04493). This nucleic acid immunization is followed by a protein boost using 
an HIV gpl40 envelope polypeptide (i) derived from a North American HIV Subtype 
B isolate/strain, and (ii) is mutated by deletion of the V2 loop (e.g., the protein product 
of gpl40.mut7.modSF162.deIV2, as described for example in PCT International 

30 Publication No. WO/00/39302). Oligomeric forms of the envelope polypeptide may 
be used (e.g., o-gpl40 as described in PCT International Publication No. 
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WO/00/39302 and US Patent No. 6,602,705). One embodiment of this aspect of the 
present invention, comprises a composition for generating an immune response in a 
mammal, the composition comprising: a polynucleotide component, comprising, a 
first polynucleotide encoding a first HIV immunogenic polypeptide; and a polypeptide 
5 component, comprising a second HIV immunogenic polypeptide, wherein said first 
and second immunogenic HIV polypeptide are derived fi-om different HIV subtypes, 
serotypes, or strains, and (ii) said first and second immunogenic polypeptides encode 
analogous HIV polypeptides. In one embodiment of the present invention, the 
analogous HIV immunogenic polypeptides coding sequences that comprise the 

10 polynucleotide composition and the HIV inununogenic polypeptides that comprise 
the polypeptide component of the present invention maybe derived fixDm different 
subtypes of HIV, in another embodiment they may derived from different strains of 
HIV fi-om the same HIV subtype. In another embodiment of this aspect of the present 
invention the polynucleotide and polypeptide components of the present invention are 

1 5 used to broadly raise neutralizing antibodies against viral strains that use the CCR5 
coreceptor for cell entry. For example, a composition for generating neutralizing 
antibodies in a mammal may comprise, a polynucleotide component consisting 
essentially of one polynucleotide encoding an HIV immunogenic polypeptide derived 
from an HIV strain that uses the CCR5 coreceptor for cell entry, and a polypeptide 

20 component comprising one or more HIV immunogenic polypeptides derived fi-om an 
HIV strain that uses the CCR5 coreceptor for cell entry analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that (i) if the polypeptide 
component has only one HIV immunogenic polypeptide, then the coding sequence of 
the HIV immunogenic polypeptide of the polypeptide component is derived firom a 

25 different HIV sfi-ain that uses the CCR5 coreceptor for cell entiy than the coding 

sequence of the immunogenic polypeptide encoded by the polynucleotide component, 
or (ii) if the polypeptide component comprises greater than one HIV immunogenic 
polypeptide, then the coding sequences of the polypeptides of the polypeptide 
component are derived from more than one HIV sti-ain that uses the CCR5 coreceptor 

30 for cell entry. 
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In second particular aspect of the present invention, the polynucleotide 
component comprises two or more polynucleotide sequences comprising coding 
sequences for two or more analogous HIV immunogenic polypeptides, wherein the 
coding sequences for at least two of the HIV immunogenic polypeptides are derived 
5 from different HIV subtypes, serotypes, or strains, and a polypeptide component 

comprising one or more HIV immunogenic polypeptides analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that, if the polypeptide 
component provides the same number or greater than the nimiber of analogous HIV 
immunogenic polypeptides encoded by the polynucleotide component, then at least 

1 0 one of the HIV immunogenic polypeptides of the polypeptide composition is derived 
from a different HIV subtype, serotype, or strain than the HIV immunogenic 
polypeptides provided by the polynucleotide component. 

In one embodiment of the present invention, two or more polynucleotides 
encoding immunogenic HIV polypeptides, derived from at least two different 

15 subtypes, serotypes, or strains are mixed (e.g., in equal amounts) for priming. Then a 
single, analogous, immunogenic HIV polypeptide derived from one of the subtypes, 
serotypes, or strains used for priming is used for boosting. A more general 
embodiment comprises a composition for generating an immxme response in a 
mammal, said composition comprising: a polynucleotide component, comprising, two 

20 or more polynucleotides each encoding analogous HIV immunogenic polypeptides, 
with the proviso that the coding sequences of each HIV immunogenic polypeptide are 
derived from different HIV subtypes, serotypes, or strains; and a polypeptide 
component, comprising one or more HIV immunogenic polypeptides, with the proviso 
that said polypeptide component comprises at least one less HIV inimunogenic 

25 polypeptide than encoded by said polynucleotide component. For example, two DNA 
molecules are used for nucleic acid immunization, wherein the first DNA molecule 
encodes an HIV gpl40 envelope polypeptide (i) derived from a South African HIV 
Subtype C isolate/strain, (ii) that is codon optimized for expression in mammalian 
cells, and (iii) is mutated by deletion of the V2 loop (e.g., gpl40mod.TVl .delV2, as 

30 described for example in PCT International Publication No. WO/02/04493), and the 
second DNA molecule encodes an HIV gpl40 envelope polypeptide (i) derived from a 
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North American HIV Subtype B isolate, (ii) that is codon optimized for expression in 
mammalian cells, and (iii) is mutated by deletion of the V2 loop (e.g., 
gpl40.modSF162.delV2, as described for example in PCT International Publication 
No. WO/00/39302). This DNA immunization is followed by a protein boost using a 
5 single HIV gpl40 envelope polypeptide (i) derived from a North American HIV 
Subtype B isolate, and (ii) is mutated by deletion of the V2 loop (e.g., the protein 
product of gpl40.mut7.modSF162.delV2, as described for example in PCT 
International Publication No. WO/00/39302). Oligomeric forms of the envelope 
polypeptide maybe used (e.g., o-gpl40 as described in PCT International Pubhcation 

10 No. WO/00/39302). One embodiment of a composition for generating an inmiune 
response in a mammal comprises, a polynucleotide component comprising a first 
polynucleotide encoding a first immunogenic HIV polypeptide, and a second 
polynucleotide encoding a second immunogenic HIV polypeptide, wherein (i) said 
first and second immunogenic HIV polypeptide are derived fi-om different HIV 

1 5 subtypes, serotypes, or strains, and (ii) said first and second immunogenic 

polypeptides encode analogous HIV polypeptides, and a polypeptide component 
comprising said first HIV immunogenic polypeptide, or said second HIV 
immunogenic polypeptide, with the proviso that said polypeptide component 
comprises at least one less HIV immunogenic polypeptide than is encoded by said 

20 polynucleotide component. In a preferred embodiment, polynucleotides encoding 
analogous immunogenic HIV polypeptides, derived from a variety of different HIV 
subtypes, serotypes, or strains are used for a prime immunization, and a single 
analogous immunogeruc HIV polypeptide is used for one or more protein boost. 

In another embodiment, two or more polynucleotides encoding immunogenic 

25 HIV polypeptides, derived from at least two different subtypes, serotypes, or strains 
are mixed (e.g., in equal amounts) for priming. Then one or more analogous, 
immunogenic HIV polypeptides derived from at least two different subtypes, 
serotypes, or strains are used for boosting, wherein at least one of the immunogenic 
HIV polypeptides is derived from a subtype, serotype, or strain not represented in the 

30 polynucleotide component. For example, the polynucleotide component comprises 
three polynucleotides encoding three immunogenic HFV polypeptides, one coding 
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sequence derived from a subtype B strain, one coding sequence derived from a 
subtype C strain, and one coding sequence derived from a subtype E strain, and the 
polypeptide component comprises three immunogenic HIV polypeptides, one coding 
sequence derived from a subtype B strain, one coding sequence derived from a 
5 subtype C strain, and one coding sequence derived from a subtype 0 strain. In another 
embodiment of this aspect of the present invention, the polynucleotides of the 
polynucleotide component comprises polynucleotides encoding analogous HIV 
immunogenic polypeptides from different subtypes, serotypes, or strains as the 
polypeptides of the polypeptide component. For example, DNA immunization with 

1 0 two or more DNA molecules encoding HIV gp 1 40 polypeptides (wherein the two or 
more gpl40 coding sequences are derived from two or more HIV-1 subtypes, 
serotypes, or sfrains). The polypeptide component, use for protein immunization, 
comprises two or more gpl40 polypeptides (wherein the two or more gpl40 coding 
sequences are derived from two or more HIV-1 subtypes, serotypes, or strains, with 

15 the proviso that at least one of the polypeptide sequences is derived from an HIV- 1 
subtype, serotype, or strain not represented in the DNA component). 

In another embodiment, the polynucleotide component comprises two or more 
polynucleotide sequences comprising coding sequences for two or more analogous 
HIV immunogenic polypeptides, wherein the coding sequences for at least two of the 

20 HTV immunogenic polypeptides are derived from different HIV strains that use the 
CCR5 coreceptor for cell entry, and the polypeptide component comprises one or 
more HIV immunogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that (i) if the polypeptide component 
provides less than the number of analogous HIV immunogenic polypeptides encoded 

25 by the polynucleotide component, then the HTV immunogenic polypeptides of the 
polypeptide composition may be derived from the same and/or different HIV strains 
that use the CCR5 coreceptor for cell entry as the HIV immunogenic polypeptides 
provided by the polynucleotide component, or (ii) if the polypeptide component 
provides the same or greater than the number of analogous HIV immunogenic 

30 polypeptides encoded by the polynucleotide component, then at least one of the HIV 
immunogenic polypeptides of the polypeptide composition is derived from a different 
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HIV strain that uses the CCR5 coreceptor for cell entry than the HIV immunogenic 
polypeptides provided by the polynucleotide component. 

In a further aspect, the present invention relates to the use of varied doses of 
polynucleotides and polypeptides in prime^oost methods, particularly the methods 
5 described herein. In any immunization method using, for example, a mixed 

polynucleotide prime (i.e., two or more polynucleotides encoding immunogenic HfV 
polypeptides derived from two or more HIV subtypes, serotypes, or strains) in 
conjunction with a polypeptide boost the present invention includes using reduced 
doses of each single component to provide an equivalent immune response to using 

10 full doses of each component. In one embodiment, the high threshold of DNA is the 
maximum tolerable dose of DNA (e.g., about 5 mg to about 10 mg total DNA), the 
low threshold of DNA is the minimum effective dose (e.g., about 2 ug to about 10 ug 
total DNA), the high threshold of protein is the maximum tolerable dose of protein 
(e.g., about 1 mg total protein), the low threshold of protein is the minimum effective 

1 5 dose (e.g., about 2 ug total protein). Experiments performed in support of the present 
invention demonstrated that the total DNA dose may be divided among the 
polynucleotides of the polynucleotide component (for example, four polynucleotide 
constructs used, the total DNA for all four is less than or equal to the high threshold) 
(e.g., Example 4). Further, the total polypeptide dose may be divided among the 

20 polypeptides comprising the polypeptide component (for example, four polypeptides 
used, the total protein for all four is less than or equal to the high threshold) (e.g., 
Example 4). The total DNA and total protein are both typically above the low 
threshold values. 

In a preferred embodiment, the total amount of DNA in a given DNA 
25 immunization has a high threshold of less than or equal to about 1 0 mg total DNA and 
greater than or equal to 1 mg total DNA, and the total amount of protein in a given 
polypeptide boost has a high threshold of less than or equal to about 200 ug total 
protein product and greater than or equal to 10 ug of total protein. For example, in an 
embodiment using a polynucleotide component having two DNA molecules each 
30 encoding an immunogenic HIV polypeptide the dose of each DNA molecule per 

subject may be one milligram of each DNA molecule encoding an immunogenic HIV 
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polypeptide, for a total of 2 mg for the two DNA molecules, or 0.5 mg of each DNA 
molecule encoding an immunogenic HIV polypeptide, for a total of 1 mg for the two 
DNA molecules. Dosing with the polypeptide component may be similarly varied, for 
example, using a polypeptide component having two immunogenic HIV polypeptides 
5 the dose of each polypeptide per subject may be 1 00 micrograms of each 

immunogenic HIV polypeptide, for a total of 200 ug for the two polypeptides, 50 
micrograms of each immunogenic HIV polypeptide, for a total of 100 ug for the two 
polypeptides, or 25 ug of each immunogenic HIV polypeptide, for a total of 50 ug for 
the two polypeptides. As described above, more than two polypeptides may be 

1 0 included in the polypeptide component of the present invention. 

In one embodiment of this aspect of the present invention, the polynucleotides 
of the polynucleotide component encode analogous HIV immunogenic polypeptides 
from the same subtypes, serotypes, or strains as the polypeptides of the polypeptide 
component. For example, two DNA molecules are used for nucleic acid 

1 5 immunization, wherein the first DNA molecule encodes an HIV gpl40 envelope 

polypeptide (i) derived from a South African HIV Subtype C isolate, (ii) that is codon 
optimized for expression in mammalian cells, (iii) is mutated by deletion of the V2 
loop (e.g., gpl40mod.TVl.delV2, as described for example in PCT Intemational 
Publication No. WO/02/04493), and (iv) is delivered at 0.5 mg, and the second DNA 

20 molecule encodes an HIV gpI40 envelope polypeptide (i) derived from a North 
American HIV Subtype B isolate, (ii) that is codon optimized for expression in 
mammalian cells, (iii) is mutated by deletion of the V2 loop (e.g., 
gpl40.modSF162.delV2, as described for example in PCT Intemational Publication 
No. WO/00/39302), and (iv) is delivered at 0.5 mg. This DNA immunization is 

25 followed by a protein boost using an HIV gpl 40 envelope polypeptide (i) derived 

from a South African HIV Subtype C isolate, (ii) is mutated by deletion of the V2 loop 
(e.g., the protein product of gpl40mod.TVl.mut7. del V2, as described for example in 
PCT Intemational PubUcation No. WO/02/04493), and (iii) is delivered at 50 ug 
protein, and an HIV gpl 40 envelope polypeptide (i) derived from a North American 

30 HIV Subtype B isolate, (ii) is mutated by deletion of the V2 loop (e.g., the protein 
product of gpl40.mut7.modSF162.delV2, as described for example in PCT 
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International Publication No. WO/00/39302), and (iii) is delivered at 50 ug protein. 
Further, oligomeric forms of the envelope polypeptide may be used (e.g., o-gpl40 as 
described in PCT International Publication No. WO/00/39302). 

In further embodiments, the polynucleotide component of the present invention 
5 may comprise one or more gene delivery vectors comprising the polynucleotide(s) 
encoding immunogenic HIV polypeptide(s). Further components that may be 
included in the polynucleotide component are described herein. The polypeptide 
component of the present invention may comprise an adjuvant in addition to the 
immunogenic polypeptide(s). Further components that may be included in the 

10 polypeptide component are described herein. 

The present invention also comprises methods for generating an immune 
response in a subject. In one general aspect, the method comprises administering to a 
subject a first component providing an immunogenic polypeptide and administering to 
a subject a second component providing a different but analogous immunogenic 

1 5 polypeptide. The first component and the second component may be polynucleotide 
components or polypeptide components. The immunogenic polypeptides may be 
provided either directly (as in a polypeptide component) or indirectly (as in a 
polynucleotide component). In a preferred embodiment, one of the components 
(either first or second component) is a polynucleotide component, and the other 

20 component (either second or first component) is a polypeptide component. Preferably, 
the polypeptide inununogens provided by the first and second components are 
analogous HIV immunogenic polypeptides. The first and second components may be 
administered simultaneously or may be administered at separate times. Preferably, the 
first and second components are administered in a prime-boost regimen. Various 

25 prime-boost regimens have been described in the art and are well known to those of 
ordinary skill. In a typical prime-boost regimen, a first component providing a 
polypeptide immunogen is administered to a subject; the initial immime response is 
followed by determining the production of binding antibodies to the polypeptide 
immunogen in said subject until the titer of binding antibodies begins to decline; and a 

30 second component providing a different but related polypeptide inmiunogen is 
administered to the subject. 
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The first and second components may be provided as a composition. In a 
particular aspect the method comprises, providing a composition of the present 
invention for generating an immime response in a mammal, administering one or more 
gene delivery vectors comprising the polynucleotides of the polynucleotide component 
5 of the composition into the subject under conditions that are compatible with 

expression of the polynucleotides in the subject for the production of encoded HIV 
immunogenic polypeptides, and administering the polypeptide component to the 
subject. The administering of the polynucleotide and polypeptide compositions may be 
concurrent or sequentially. In a preferred embodiment immunization with the 

1 0 polynucleotide component precedes immunization with the polypeptide component. 
Further, a single prime may be followed by multiple boosts, multiple primes may be 
followed by a single boost, multiple primes may be followed by multiple boosts, or a 
series of primes and boosts may be used. The polynucleotide component may 
comprise further components (e.g., components for enhancing immune response, 

1 5 carriers, etc.). The polypeptide component may comprise further components (e.g., 
components for enhancing immune response, carriers, etc.). 

Exemplary polynucleotide constructs, methods of making the polynucleotide 
constructs, corresponding polypeptide products, and methods of making polypeptides 
useful for HIV immunization have been previously described, for example, in the 

20 following PCT Intemational Publication Nos,: WO/00/39302; WO/00/39303; 

WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and WO/03/020876. 

Although described generally with reference to HIV subtypes B and C as 
exemplary subtypes, the compositions and methods of the present invention are 
applicable to a wide variety of HIV subtypes, serotypes, or strains and immunogenic 

25 polypeptides encoded thereby, including but not limited to the previously identified 
HIV-1 subtypes A through K, N and 0, the identified CRFs (circulating recombinant 
forms), and HIV-2 strains and its subtypes. See, e.g., Myers, et al., Los Alamos 
Database, Los Alamos National Laboratory, Los Alamos, New Mexico; Myers, et al., 
Human Retroviruses and Aids, 1990, Los Alamos, New Mexico: Los Alamos National 

30 Laboratory. Further, the compositions and methods of the present invention may be 
used to raise broadly reactive neutralizing antibodies against viral strains and subtypes 
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that use the CCR5 coreceptor for cell entry (for example, both TVl and SF162 use the 
CCR5 coreceptor (Example 4)). 

The polypeptide component of the present invention may comprise fragments 
of immunogenic polypeptide, for example, wherein the polypeptide sequence or a 
5 portion thereof contains an amino acid sequence of at least 3 to 5 amino acids, more 
preferably at least 8 to 10 amino acids, and even more preferably at least 15 to 20 
amino acids from a polypeptide encoded by the nucleic acid sequence. Also 
encompassed are polypeptide sequences that are immunologically identifiable with a 
polypeptide encoded by the sequence. Further, polyproteins can be constructed by 
1 0 fusing in-frame two or more polynucleotide sequences encoding polypeptide or 
peptide products. 

In addition, the polynucleotide component of the present invention may 
comprise one or more monocistronic expression cassettes comprising polynucleotides 
encoding immunogenic HFV polypeptides, or one or more polycistronic expression 

1 5 cassettes comprising polynucleotides encoding immunogenic HIV polypeptides, or 
combinations thereof Polycistronic coding sequences may be produced, for example, 
by placing two or more polynucleotide sequences encoding polypeptide products 
adjacent each other, typically under the control of one promoter, wherein each 
polypeptide coding sequence may be modified to include sequences for internal 

20 ribosome binding sites. 

A variety of combinations of polynucleotides encoding immunogenic 
polypeptides (e.g., HIV immunogenic polypeptides) and immunogenic polypeptides or 
fiiagments thereof (e.g., HIV immunogenic polypeptides) can be used in the practice of 
the present invention. Polynucleotide sequences encoding immunogenic polypeptides 

25 can be included in a polynucleotide component of compositions of the present 

invention, for example, as DNA immunization constructs containing, for example, a 
synthetic Env expression cassettes, a synthetic Gag expression cassette, a synthetic 
pol-derived polypeptide expression cassette, a synthetic expression cassette 
comprising sequences encoding one or more accessory or regulatory genes (e.g., tat, 

30 rev, nef, vif, vpu, vpr). Immunogenic polypeptides may be included as purified 

polypeptides in the polypeptide component of compositions of the present invention. 
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The immunogenic polypeptides may be synthetic or wild-type. In preferred 
embodiments the immimogenic polypeptides are antigenic viral proteins, or fragments 
thereof. 

5 2.2.0 Identification OF Analogous Polypeptides AND Polynucleotides 
Encoding Such Polypeptides 

The compositions and methods of the present invention are described with 
reference to exemplary HIV-l sequences. The present invention is not limited to the 
sequences described herein. Numerous sequences for use in the practice of the present 

1 0 invention have been previously described (see, e.g., PCT International Publication 
Nos. WO/00/39302; WO/00/39303; WO/00/39304; WO/02/04493; WO/03/004657; 
WO/03/004620; and WO/03/020876.). Typically, the polynucleotide sequences used 
in the practice of the present invention encode polypeptides derived from a viral 
source (e.g., HIV-l). The polypeptides are typically derived from antigenic viral 

1 5 proteins, in particular, group specific antigen polypeptides, envelope polypeptides, 
capsid polypeptides, and other structural and non-structural polypeptides. The present 
invention is particularly described with reference to the use of envelope polypeptides 
and modifications thereof (and polynucleotides encoding same) derived from various 
subtypes, serotypes, or strains of the HIV-l virus. Other HFV-l polypeptides and 

20 polynucleotides encoding such polypeptides may be used in the practice of the present 
invention including, but not limited to. Gag, Pol (including Protease, Reverse 
Transcriptase, and Integrase), Tat, Rev, Nef, Vif, Vpr, and Vpu. 

The HIV genome and various polypeptide-encoding regions are shown in 
Table 1. The nucleotide positions are given relative to an HIV-l Subtype C isolate 

25 from South Africa strain 8_5_TV]_C.ZA (Figures 1 A-ID). However, it will be 
readily apparent to one of ordinary skill in the art in view of the teachings of the 
present disclosure how to determine corresponding regions in other HIV strains (from 
the same or different subtypes) or variants {e.g., isolates HlVmb, HrVsF2, HIV-1sfi62j 
HIV-lsFi7o, HIVlav, HIVlai, HIVmn, HIV-1cm235„ HIV-1us4, other HFV-I strains 

30 from diverse subtypes(e.g., subtypes, A through K, N and 0), the identified CRFs 
(circulating recombinant forms), HIV-2 strains and diverse subtypes and strains (e.g., 
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HIV-2uci and HrV-2uc2), and simian immunodeficiency virus (SIV). (See, e.g., 
Virology, 3rd Edition (W.K. Jokiik ed. 1988); Fundamental Virology, 2nd Edition 
(B.N. Fields and D.M. Kiiipe, eds. 1991); Virology, 3rd Edition (Fields, BN, DM 
Knipe, PM Howley, Editors, 1996, Lippincott-Raven, Philadelphia, PA; for a 
description of these and other related viruses), using for example, sequence 
comparison programs {e.g., BLAST and others described herein) or identification and 
alignment of structural features (e.g., a program such as the "ALB" program described 
herein that can identify the various regions). 

Table 1 

Regions of the fflV Genome relative to the Sequence of 8_5_TV1_C.ZA 



Region 


Position in nucleotide sequence 


5'LTR 


1-636 


U3 


1-457 


R 


458-553 


U5 


554-636 


NFkBII 


340-348 


NFkBI 


354-362 


spi ni 


379-388 


Spin 


390-398 


Spl I 


400-410 


TATA Box 


429-433 


TAR 


474-499 


Poly A signal 


529-534 


PBS 


638-655 


p7 binding region, packaging signal 


685-791 


Gag: 


792-2285 


pl7 


792-1178 
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Region 


Position in nucleotide sequence 


p24 


1179-1871 


Cyclophilin A bdg. 


1395-1505 


MHR 


1632-1694 


p2 


1872-1907 


p7 


1908-2072 


Frameshift slip 


2072-2078 


pl 


2073-2120 


p6Gag 


2121-2285 


Zn-motif I 


1950-1991 


Zn-motifll 


2013-2054 


Pol: 


2072-5086 


p6Pol 


2072-2245 


Prot 


2246-2542 


p66RT 


2543-4210 


plSRNaseH 


3857-4210 


p31Int 


4211-5086 


Vif: 


5034-5612 


Hydrophilic region 


5292-5315 


Vpr: 


5552-5839 


Oligomerization 


5552-5677 


Amphipathic a-helix 


5597-5653 


Tat: 


5823-6038 and 8417-8509 


Tat-1 exon 


5823-6038 


Tat-2 exon 


8417-8509 


N-terminal domain 


5823-5885 


Trans-activation domain 


5886-5933 
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Region 


Position in nucleotide sequence 


Transduction domain 


5961-5993 


Rev: 


5962-6037 and 8416-8663 


Rev-1 exon 


5962-6037 


Rev-2 exon 


8416-8663 


High-affinity bdg. site 


8439-8486 


Leu-rich effector domain 


8562-8588 


Vpu: 


6060-6326 


Transmembrane domain 


6060-6161 


Cytoplasmic domain 


6162-6326 


Env (gpl60): 


6244-8853 


Signal peptide 


6244-6324 


gpl20 


6325-7794 


VI 


6628-6729 


V2 


6727-6852 


V3 


7150-7254 


V4 


7411-7506 


V5 


7663-7674 


CI 


6325-6627 


C2 


6853-7149 


C3 


7255-7410 


C4 


7507-7662 


C5 


7675-7794 


CD4 binding 


7540-7566 


gp41 


7795-8853 . 


Fusion peptide 


7789-7842 


Oligomerization domain 


7924-7959 


N-terminal heptad repeat 


7921-8028 
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Region 


Position in nucleotide sequence 


C-terminal heptad repeat 


8173-8280 


Immunodominant region 


8023-8076 


Nef: 


8855-9478 


Myristoylation 


8858-8875 


SH3 binding 


9062-9091 


Polypurine tract 


9128-9154 


SH3 binding 


9296-9307 



It will be readily apparent that one of skill in the art can align any HIV 
sequence to that shown in Table 1 to determine relative locations of any particular 
HIV gene. For example, using one of the alignment programs described herein {e.g., 
BLAST), other HIV genomic sequences can be aligned with 8_5_TV1_C.ZA (Table 
5 1) and locations of genes determined. Polypeptide sequences can be similarly aligned. 
For example, Figures 2A-2E shows the alignment of Env polypeptide sequences from 
various strains, relative to SF-162. As described in detail in PCT International 
Publication No. WO/00/39303, Env polypeptides (e.g., gpl20, gpl40 and gpl60) 
include a "bridging sheet" comprised of 4 anti-parallel beta-strands (beta-2, beta-3, 

10 beta -20 and beta -21) that form a beta -sheet. Extruding from one pair of the beta - 
strands (beta -2 and beta -3) are two loops, VI and V2. The beta -2 sheet occurs at 
approximately amino acid residue 113 (Cys) to amino acid residue 117 (Thr) while 
beta -3 occurs at approximately amino acid residue 192 (Ser) to amino acid residue 
194 (He), relative to SF-162. The "V1/V2 region" occurs at approximately amino acid 

15 positions 120 (Cys) to residue 189 (Cys), relative to SF-162. Extruding from the 
second pair of beta -strands (beta -20 and beta -21) is a "small-loop" structure, also 
referred to herein as "the bridging sheet small loop." The locations of both the small 
loop and bridging sheet small loop can be determined relative to HXB-2 following the 
teachings herein and in PCT International Publication No. WO/00/39303. Also 

20 shown by arrows in Figures 2A-2E are approximate sites for deletions sequence from 
the beta sheet region. The "*" denotes N-glycosylation sites that can be mutated 
following the teachings of the present specification. 
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2.3.0 . Expression Cassettes Comprising Polynucleotide Sequences, 
Vectors, Polypeptides, Further Components, and Formulations 
Useful in the Practice of the Present Invention 

5 Compositions for the generation of immune responses of the present invention 

comprise a polynucleotide component and a polypeptide component. The 
polynucleotide component of may comprise one or more polynucleotides encoding 
immunogenic viral polypeptides. Such polynucleotides may comprise native viral 
sequences encoding immunogenic viral polypeptides or synthetic polynucleotides 

10 encoding immunogenic polypeptides. Synthetic polynucleotides may include 

sequence optimization to provide improved expression of the encoded polypeptides 
relative to the analogous native polynucleotide sequences. Further, synthetic 
polynucleotides may comprise mutations (single or multiple point mutations, missense 
mutations, nonsense mutations, deletions, insertions, etc.) relative to corresponding 

15 wild-type sequences. 

The polypeptide component of the compositions of the present invention may 
comprise one or more immunogenic viral polypeptide. Such polypeptides may 
comprise native immunogenic viral polypeptides or modified immunogenic 
polypeptides. Modified polypeptides may include sequence optimization to provide 

20 improved expression of the polypeptides relative to the analogous native 

polynucleotide sequences. Further, modified polypeptides may comprise mutations 
(single or multiple point mutations, missense mutations, nonsense mutations, 
deletions, insertions, etc.) relative to corresponding wild-type sequences. 

The compositions of the present invention, comprising a polynucleotide 

25 component and a polypeptide component, are described with reference to HIV-1 

derived sequences. However, the compositions and methods of the present invention 
are applicable to other types of viruses as well, wherein such viruses comprise 
multiple subtypes, serotypes, and/or strain variations, for example, including but not 
Umited to other non-HIV retroviruses (e.g. HTLV-1, 2), hepadnoviruses (e.g. HBV), 

30 herpesviruses (e.g. HSV-1, 2, CMV, EBV, varizella-zoster, etc.), flaviviruses (e.g. 
HCV, Yellow fever, Tick borne encephalitis, St. Louis Encephalitis, West Nile Virus, 
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etc.), coronavimses (e.g. SARS), paramyxoviruses (e.g., PIV, RSV, measles etc.), 
influenza viruses, picomaviruses, reoviruses (e.g., rotavirus),' arenaviruses, 
rhabdovinises, papovaviruses, parvoviruses, adenoviruses, Dengue virus, bunyaviruses 
(e.g. , hantavirus), calciviruses (e.g. Norwalk virus), filoviruses (e.g. , Ebola, 
5 Marburg). 

2.3.1 Modification of Polynucleotide Coding Sequences 

HIV-1 coding sequences, and related sequences, may be modified to have 
improved expression in target cells relative to the corresponding wild-type sequences. 
1 0 Following here are some exemplary modifications that can be made to such coding 
sequences. 

First, the HIV-1 codon usage pattern may be modified so that the resulting 
nucleic acid coding sequence are comparable to codon usage found in highly 
expressed human genes. The HIV codon usage reflects a high content of the 

1 5 nucleotides A or T of the codon-triplet. The effect of the HIV-1 codon usage is a high 
AT content in the DNA sequence that results in a decreased translation ability and 
instability of the mRNA. In comparison, highly expressed hviman codons prefer the 
nucleotides G or C. The HIV coding sequences may be modified to be comparable to 
codon usage found in highly expressed human genes. 

20 Second, there are inhibitory (or instability) elements (INS) located within the 

coding sequences of, for example, the Gag coding sequences. The RRE is a secondary 
RNA structure that interacts with the HIV encoded Rev-protein to overcome the 
expression down-regulating effects of the INS. To overcome the post-transcriptional 
activating mechanisms of RRE and Rev, the instability elements can be inactivated by 

25 introducing multiple point mutations that do not alter the reading frame of the encoded 
proteins. 

Third, for some genes the coding sequence has been altered such that the 
polynucleotide coding sequence encodes a gene product that is inactive or non- 
functional (e.g., inactivated polymerase, protease, tat, rev, nef, vif, vpr, and/or vpu 
30 gene products). Example 1 describes some exemplary mutations. 
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The synthetic coding sequences are assembled by methods known in the art, 
for example by companies such as the Midland Certified Reagent Company (Midland, 
Texas), following the guidance of the present specification. 

Some exemplary synthetic polynucleotide sequences encoding immunogenic 
5 HIV polypeptides and the polypeptides encoded thereby for use in the methods of the 
present invention have been described, for example, in PCT International Publication 
Nos. WO/00/39303, WO/00/39302, WO 00/39304, WO/02/04493, WO/03/020876, 
WO/03/004620, and WO/03/004657. 

In a preferred embodiment, the present invention relates to polynucleotides 

10 encoding Env polypeptides and corresponding Env polypeptides. For example, the 
codon usage pattern for Env may be modified so that the resulting nucleic acid coding 
sequence is comparable to codon usage found in highly expressed human genes. Such 
synthetic Env sequences are capable of higher level of protein production relative to 
the native Env sequences (see, for example, PCT htemational Publication Nos. 

1 5 WO/00/39302). Modification of the Env polypeptide coding sequences results in 
improved expression relative to the wild-type coding sequences in a number of 
mammalian cell lines (as well as other types of cell lines, including, but not limited to, 
insect cells). Similar Env polypeptide coding sequences can be obtained, modified 
and tested for improved expressioii firom a variety of isolates. 

20 Further modifications of Env include, but are not limited to, generating 

polynucleotides that encode Env polypeptides having mutations and/or deletions 
therein. For instance, the hypervariable regions, VI and/or V2, can be deleted as 
described herein. In addition, the variable regions V3, V4 and/ or V5 can be modified 
or deleted. (See e.g, US 6,602,705) Additionally, other modifications, for example to 

25 the bridging sheet region and/or to N-glycosylation sites within Env can also be 

performed following the teachings of the present specification, (see, Figures 2A-2E, 
as well as PCT International Publication Nos. WO/00/39303, WO/00/39302, WO 
00/39304, WO/02/04493, WO/03/020876, and WO/03/004620). Other useful 
modifications of env are well known and include those described in Schulke et al, (J. 

30 Virol. 2002 76:7760), Yang et al. 2002, (J. Virol. 2002 76:4634), Yang et al. 2001( J. 
Virol. 2001 75:1 165), Shu et al. (Biochem. 1999 38:5378), Farzan et al. (J.Virol. 1998 
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11:1 eiQi) and Xiang et al. (J.Virol. 2002 76:9888). Various combinations of these 
modifications can be employed to generate synthetic expression cassettes and 
corresponding polypeptides as described herein. 

The present invention also includes expression cassettes which include 
5 synthetic sequences derived HIV genes other than Env, including but not limited to, 
regions within Gag, Env, Pol, as well as, tat, rev, nef, vif, vpr, and vpu. Further, the 
present invention includes synthetic polynucleotides and/or expression cassettes (as 
well as polypeptide encoded thereby) comprising two or more antigenic polypeptides. 
Such sequences may be used, for example, in their entirety or sequences encoding 

1 0 specific epitopes or antigens may be selected fi-om the synthetic coding sequences 
following the teachings of the present specification and information known in the art. 
For example, the polypeptide sequences encoded by the polynucleotides may be 
subjected to computer analysis to predict antigenic peptide fragments within the full- 
length sequences. The corresponding polynucleotide coding fragments may then be 

1 5 used in the constructs of the present invention. Exemplary algorithms useful for such 
analysis include, but are not limited to, the following: 

AMPHl. This program has been used to predict T-cell epitopes (Gao, et al., 
(1989) J. hnmunol. 143:3007; Roberts, et al, (1996) AIDS Res Hum Retrovir 12:593; 
Quakyi, et al., (1992) Scand J Immunol suppl. 11:9). The AMPHI algorithm is 

20 available int the Protean package of DNASTAR, Inc. (Madison, Wl, USA). 

ANTIGENIC INDEX. This algorithm is useful for predicting antigenic 
determinants (Jameson & Wolf, (1998) CABIOS 4:181:186; Sherman, KE, et al., 
Hepatology 1996 Apr;23(4):688-94; Kasturi, KN, et al, J Exp Med 1995 Mar 
1;1 81(3): 1027-36; van Kampen V, et al., Mol hnmunol 1994 Oct;31(15):l 133-40; 

25 Ferroni P, et al., J Clin Microbiol 1993 Jun;3 1(6): 1586-91 ; Beattie J, et al., Eur J 
Biochem 1992 Nov 15;210(l):59-66; Jones GL, et al, Mol Biochem Parasitol 1991 
Sep;48(l):l-9). 

HYDROPHILICITY. One algorithm useful for determining antigenic 
determinants firom amino acid sequences was disclosed by Hopp & Woods (1981) 
30 (PNAS USA 78:3824-3828. 
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Default parameters, for the above-recited algorithms, may be used to determine 
antigenic sites. Further, the results of two or more of the above analyses may be 
combined to identify particularly preferred fragments. 

5 2.3.2 Further Modification OF Polynucleotide Sequences And 
Polypeptides Encoded Thereby 

The immunogenic viral polypeptide-encoding expression cassettes described 
herein may also contain one or more fiirther sequences encoding, for example, one or 
more transgenes. In one embodiment of the present invention, the polynucleotide 

1 0 component may comprise coding sequences for one or more HIV immimogenic 
polypeptides. Further, the polypeptide component may comprise one or more HIV 
immunogenic polypeptide. In a different embodiment of the present invention, a 
polynucleotide component may comprise coding sequences for one or more HIV 
immunogenic polypeptides, wherein the polynucleotide component further comprises 

1 5 a sequence encoding an additional antigenic polypeptide, with the proviso that the 
additional antigenic polypeptide is not an immimogenic polypeptide derived from an 
HIV-1 strain. Further, the polypeptide component may comprise one or more HIV 
immunogenic polypeptides, wherein the polypeptide component further comprises an 
additional antigenic polypeptide, with the proviso that the additional antigenic 

20 polypeptide is not an immunogenic polypeptide derived from an HIV-1 strain. 

Further sequences (e.g., transgenes) useful in the practice of the present 
invention include, but are not limited to, further sequences are those encoding further 
viral epitopes/antigens {including but not limited to, HCV antigens (e.g.. El, E2; 
Houghton, M.., et al., U.S. Patent No. 5,714,596, issued February 3, 1998; Houghton, 

25 M.., et al., U.S. Patent No. 5,712,088, issued January 27, 1998; Houghton, M.., et at., 
U.S. Patent No. 5,683,864, issued November 4, 1997; Weiner, A.J., et al., U.S. Patent 
No. 5,728,520, issued March 17, 1998; Weiner, A.J., et al., U.S. Patent No. 5,766,845, 
issued June 16, 1998; Weiner, A.J., et al, U.S. Patent No. 5,670,152, issued 
September 23, 1997), HIV antigens (e.g., derived from one or more HIV isolate); and 

30 sequences encoding tumor antigens/epitopes. Further sequences may also be derived 
from non-viral sources, for instance, sequences encoding cytokines such interleukin-2 
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(IL-2), stem cell factor (SCF), interleukin 3 (IL-3), interleukin 6 (IL-6), interleukin 12 
(IL-12), G-CSF, granulocyte macrophage-colony stimulating factor (GM-CSF), 
interleukin-l alpha (IL-1 alpha), interleukin-1 1 (IL-1 1), MIP-1, tumor necrosis factor 
(TNF), leukemia inhibitory factor (LIF), c-kit ligand, thrombopoietin (TPO) and fltS 
5 ligand, commercially available from several vendors such as, for example, Genzyme 
(Framingham, MA), Genentech (South San Francisco, CA), Amgen (Thousand Oaks, 
CA), R&D Systems and hnmunex (Seattle, WA). Additional sequences are described 
herein below. 

HIV polypeptide coding sequences can be obtained from other HIV isolates, 
10 see, e.g., Myers et al. Los Alamos Database, Los Alamos National Laboratory, Los 
Alamos, New Mexico (1992); Myers et al.. Human Retroviruses and Aids, 1997, Los 
Alamos, New Mexico: Los Alamos National Laboratory. Synthetic expression 
cassettes can be generated using such coding sequences as starting material by 
following the teachings of the present specification. 
1 5 Further, the synthetic expression cassettes of the present invention include 

related polypeptide sequences having greater than 85%, preferably greater than 90%, 
more preferably greater than 95%, and most preferably greater than 98% sequence 
identity to the polypeptides encoded by the synthetic expression cassette sequences 
disclosed herein. 

20 Exemplary expression cassettes and modifications are set forth in Example 1 

and are discussed further herein below. 

Further, the polynucleotides of the present invention may comprise altemative 
polymer backbone structures such as, but not limited to, polyvmyl backbones (Pitha, 
Biochem Biophys Acta, 204:39, 1970a; Pitha, Biopolymers, 9:965, 1970b), and 

25 morpholino backbones (Summerton, J., et al, U.S. Patent No. 5,142,047, issued 

08/25/92; Summerton, J., et al, U.S. Patent No. 5,185,444 issued 02/09/93). A variety 
of other charged and uncharged polynucleotide analogs have been reported. 
Numerous backbone modifications are knovm in the art, including, but not limited to, 
uncharged linkages {e.g., methyl phosphonates, phosphotriesters, phosphoamidates, 

30 and carbamates) and charged linkages (e.g., phosphorothioates and 
phosphorodithioates. 
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2.3.3 Exemplary Cloning Vectors and Systems for Use with the 

Polynucleotide Sequences Encoding Immunogenic Polypeptides 
Polynucleotide sequences for use in the compositions and methods of the 
5 present invention can be obtained using recombinant methods, such as by screening 
cDNA and genomic libraries from cells expressing the gene, or by deriving the gene 
from a vector known to include the same. Furthermore, the desired gene can be 
isolated directly from cells and tissues containing the same, using standard techniques, 
such as phenol extraction and PCR of cDNA or genomic DNA. See, e.g., Sambrook 

10 et al., supra, for a description of techniques used to obtain and isolate DNA. The gene 
of interest can also be produced synthetically, rather than cloned. The nucleotide 
sequence can be designed with the appropriate codons for the particular amino acid 
sequence desired. In general, one will select preferred codons for the intended host in 
which the sequence will be expressed. The complete sequence is assembled from 

1 5 overlapping oUgonucleotides prepared by standard methods and assembled into a 
complete coding sequence. See, e.g., Edge, Nature (1981) 292:756; Nambair et al.. 
Science (1984) 223:1299; Jay et al., J. Biol. Chem. (1984) 259:631 1; Stemmer, 
W.P.C., (1995) 164:49-53. 

Next, the gene sequence encoding the desired antigen can be inserted into a 

20 vector containing a synthetic expression cassette of the present invention. In one 
embodiment, polynucleotides encoding selected antigens are sepju-ately cloned into 
expression vectors (e.g., a first Env-coding polynucleotide in a first vector, a second 
analogous Env-coding polynucleotide in a second vector). In certain embodiments, 
the antigen is inserted into or adjacent a synthetic Gag coding sequence such that 

25 when the combined sequence is expressed it resuUs in the production of VLPs 
comprising the Gag polypeptide and the antigen of interest, e.g., Env (native or 
modified) or other antigen(s) (native or modified) derived from HIV. Insertions can 
be made within the coding sequence or at either end of the coding sequence (5', amino 
terminus of the expressed Gag polypeptide; or 3', carboxy terminus of the expressed 

30 Gag polypeptide)(Wagner, R., et al., Arch Virol. 127:117-137, 1992; Wagner, R., et 
al., Virology 200:162-175, 1994; Wu, X., et al.,y. Virol. 69(6):3389-3398, 1995; 
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Wang, C-T., et al.. Virology 200:524-534, 1 994; Chazal, N., et al.. Virology 68(1): 1 1 1- 
122, 1994; Griffiths, J.C., et al., J. Virol. 67(6):3191-3198, 1993; Reicin, A.S., et al., 
J. Virol. 69(2):642-650, 1995). Up to 50% of the coding sequences of p55Gag can be 
deleted without affecting the assembly to virus-like particles and expression efficiency 
5 (Borsetti, A., et al, Virol. 72(11):93 13-9317, 1998; Gamier, L., et al., / Virol 
72(6):4667-4677, 1998; Zhang, Y.,etal.,/F/w/ 72(3): 1782- 1789, 1998; Wang, C, 
et al., J Virol 72(10): 7950-7959, 1998). When sequences are added to the amino 
terminal end of Gag, the polynucleotide can contain coding sequences at the 5' end 
that encode a signal for addition of a myristic moiety to the Gag-containing 

1 0 polypeptide (e.g., sequences that encode Met-Gly). 

Expression cassettes for use in the practice of the present invention can also 
include control elements operably linked to the coding sequence that allow for the 
expression of the gene in vivo in the subject species. For example, typical promoters 
for mammalian cell expression include the SV40 early promoter, a CMV promoter 

1 5 such as the CMV immediate early promoter, the mouse mammary tumor virus LTR 
promoter, the adenovirus major late promoter (Ad MLP), and the herpes simplex virus 
promoter, among others. Other nonviral promoters, such as a promoter derived from 
the murine metallothionein gene, will also find use for mammaUan expression. 
Typically, transcription termination and polyadenylation sequences will also be 

20 present, located 3' to the translation stop codon. Preferably, a sequence for 

optimization of initiation of translation, located 5' to the coding sequence, is also 
present. Examples of transcription terminator/polyadenylation signals include those 
derived from SV40, as described in Sambrook et al., supra, as well as a bovme growth 
hormone terminator sequence. 

25 Enhancer elements may also be used herein to increase expression levels of the 

mammaUan constructs. Examples include the SV40 early gene enhancer, as described 
in Dijkema et al., EMBO J. (1 985) 4:761, the enhancer/promoter derived from the long 
terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman et al, Proc. 
Natl. Acad. Sci. USA (1982b) l^.eni and elements derived from human CMV, as 

30 described in Boshart et al., Cell (1985) 41 :521, such as elements included in the CMV 
mtron A sequence. 
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Furthermore, plasmids can be constructed which include a chimeric antigen- 
coding gene sequences, encoding, e.g., multiple antigens/epitopes of interest, for 
example derived from more than one viral isolate. 

Typically the antigen coding sequences precede or follow the synthetic coding 
5 sequence and the chimeric transcription unit will have a single open reading frame 
encoding both the antigen of interest and the synthetic coding sequences. 
Alternatively, multi-cistronic cassettes (e.g., bi-cistronic cassettes) can be constructed 
allowing expression of multiple antigens from a single mRNA using the EMCV IRES, 
or the like. 

10 In one embodiment of the present invention, the polynucleotide component of 

an immune generating composition may comprise, for example, the following: a first 
expression vector comprising a first Env expression cassette, wherein the Env coding 
sequence is derived from a first HIV subtype, serotype, or strain, and a second 
expression vector comprising a second Env expression cassette, wherein the Env 

1 5 coding sequence is derived from a second HIV subtype, serotype, or strain. 

Expression cassettes comprising coding sequences of the present invention may be 
combined in any nimiber of combinations depending on the coding sequence products 
(e.g., HIV polypeptides) to which, for example, an immunological response is desired 
to be raised. In yet another embodiment, synthetic coding sequences for multiple 

20 HIV-derived polypeptides may be constructed into a polycistronic message under the 
control of a single promoter wherein IRES are placed adjacent the coding sequence for 
each encoded polypeptide. 

Exemplary polynucleotide sequences of interest for use in the present 
invention may be derived from strains including, but not limited to: Subtype B-SF162, 

25 Subtype C-TVl .8_2 (8_2_TV1_C.ZA), Subtype C-TVl .8_5 (8_5_TV1_C.ZA), 
Subtype C-TV2. 12-5/1 (12-5_1_TV2_C.ZA), Subtype C-MJ4, hidia Subtype C- 
93IN101, Subtype A-Q2317, Subtype D-92UG001, Subtype E-cm235, Subtype A 
HIV-1 isolate Q23-17 from Kenya GenBank Accession AF004885, Subtype A HIV-1 
isolate 98UA01 16 from Ukraine GenBank Accession AF413987, Subtype A HIV-1 

30 isolate SE853 8 from Tanzania GenBank Accession AF069669, Subtype A Human 
immunodeficiency virus 1 proviral DNA, complete genome, clone:pUG031-Al 
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GenBank Accession AB098330, Subtype D Human immunodeficiency virus type 1 
complete proviral genome, strain 92UG001 GenBank Accession AJ320484, Subtype 
D HIV-1 isolate 94UG1 14 from Uganda GenBank Accession U88824, Subtype D 
Human immunodeficiency virus type 1, isolate ELIGenBank Accession K03454, and 
5 Indian Subtype C Human immunodeficiency virus type 1 subtype C genomic RNA 
GenBank Accession AB023804. 

Polynucleotide coding sequences used in the present invention may encode 
functional gene products or be mutated to reduce (relative to wild-type), attenuate, 
inactivate, eliminate, or render non-fimctional the activity of the gene product(s) 

10 encoded the synthetic polynucleotide. 

Once complete, the expression cassettes are typically used in constructs for 
nucleic acid immunization using standard gene delivery protocols. Methods for gene 
delivery are known in the art. See, e.g., U.S. Patent Nos. 5,399,346, 5,580,859, 
5,589,466. Genes can be delivered either directly to the vertebrate subject or, 

1 5 alternatively, delivered ex vivo, to cells derived from the subject and the cells 
reimplanted in the subject. 

A number of viral based systems have been developed for gene transfer into 
mammalian cells. Selected sequences can be inserted into a vector and packaged in 
retroviral particles using techniques known in the art. The recombinant virus can then 

20 be isolated and delivered to cells of the subject either in vivo or ex vivo. A number of 
viral based systems have been developed for use as gene transfer vectors for 
mammaUan host cells. For example, retroviruses (in particular, lenti viral vectors) 
provide a convenient platform for gene delivery systems. A coding sequence of 
interest (for example, a sequence usefiil for gene therapy applications) can be inserted 

25 into a gene delivery vector and packaged in retroviral particles using techniques 

known in the art. Recombinant virus can then be isolated and delivered to cells of the 
subject either in vivo or ex vivo. A number of retroviral systems have been described, 
including, for example, the following: (U.S. Patent No. 5,219,740; Miller et al. (1989) 
BioTechniques 7:980; Miller, A.D. (1990) Human Gene Therapy 1:5; Scarpa et al. 

30 (1991) Virology 180:849; Bums et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033; 
Boris-Lawrie et al. (1993) Cur. Opin. Genet. Develop. 3:102; GB 2200651; EP 
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0415731; EP 0345242; PCT International Publication No. WO 89/02468; PCT 
International Publication No. WO 89/05349; PCT International Publication No. WO 
89/09271; PCT International Publication No. WO 90/02806; PCT International 
Publication No. WO 90/07936; PCT International Publication No. WO 90/07936; PCT 
5 International Publication No. WO 94/03622; PCT International Publication No. WO 
93/25698; PCT International Publication No. WO 93/25234; PCT International 
Publication No. WO 93/11230; PCT International Publication No. WO 93/10218; PCT 
International Publication No. WO 91/02805; in U.S. 5,219,740; U.S. 4,405,712; U.S. 
4,861,719; U.S. 4,980,289 and U.S. 4,777,127; in U.S. Serial No. 07/800,921; and in 

10 Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 53:962-967; Ram 
(1993) Cancer Res 53:83-88; Takamiya (1992) JNeurosci Res 33:493-503; Baba 
(1993) J Neurosurg 79:729-735] Mann (1983) Ce// 33:153; Cme (1984) Proc Natl 
AcadSci USA 81;6349; and Miller (1990) Human Gene Therapy \. 

One type of retrovirus, the murine leukemia virus, or "MLV", has been widely 

1 5 utilized for gene therapy applications (see generally Mann et al. (Cell 33:153, 1 993), 
Cane and Mulligan ^ProcJNTa/ 7. /Icflc?. Sci. C/&4 81:6349, 1984), and Miller etal., 
Human Gene Therapy 1:5-14,1990. 

Lentiviral vectors may be readily constructed from a wide variety of 
lentiviruses (see RNA Tumor Viruses, Second Edition, Cold Spring Harbor 

20 Laboratory, 1985). Representative examples of lentiviruses included HIV, HIV-l, 
HIV-2, FIV and SIV. Such lentiviruses may either be obtained from patient isolates, 
or, more preferably, fix>m depositories or collections such as the American Type 
Culture Collection, or isolated from known sources using available techniques. 
Portions of the lentiviral gene delivery vectors (or vehicles) may be derived from 

25 different viruses. For example, in a given recombinant lentiviral vector, LTRs may be 
derived from an HIV, a packaging signal from SIV, and an origin of second strand 
synthesis from HrV-2. Lentiviral vector constructs may comprise a 5' lentiviral LTR, 
a tRNA binding site, a packaging signal, one or more heterologous sequences, an 
origin of second strand DNA synthesis and a 3' LTR. The lentiviral vectors have a 

30 nuclear transport element that, in preferred embodiments is not RRE. Representative 
examples of suitable nuclear transport elements include the element in Rous sarcoma 
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virus (Ogert, et al., J ViroL 70, 3834-3843, 1996), the element in Rous sarcoma virus 
(Liu & Mertz. Genes & Dev., 9, 1766-1789, 1995) and the element in the genome of 
simian retrovirus type I (Zolotukhin, et al., / ViroL 68, 7944-7952, 1994). Other 
potential elements include the elements in the histone gene (Kedes, Annu. Rev. 
5 Biochem. 48, 837-870, 1970), interferon gene (Nagata et al. Nature 287, 401-408, 
1980), adrenergic receptor gene (Koilka, et al., Nature 329, 75-79, 1987), and the c- 
Jun gene (Hattorie, et al., Proc. Natl. Acad. Sci. (75^55,9148-9152,1988). 

A number of adenovirus vectors have also been described. Unlike retroviruses 
which integrate into the host genome, adenoviruses persist extrachromosomally thus 

1 0 minimizing the risks associated with insertional mutagenesis (Haj-Ahmad and 
Graham, J. Virol. (1986) 57:267-274; Bett et al., J. Virol. (1993) 67:5911-5921; 
Mittereder et al., Human Gene Therapy (1994) 5:717-729; Seth et al., J. Virol. (1994) 
68:933-940; Barr et al.. Gene Therapy (1994) 1:51-58; Berkner, K.L. BioTechniques 
(1988) 6:616-629; and Rich et al., Human Gene Therapy (1993) 4:461-476). 

1 5 Additionally, various adeno-associated virus (AAV) vector systems have been 

developed for gene delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Patent Nos. 5,173,414 and 5,139,941; 
PCT International Publication Nos. WO 92/01070 (pubHshed 23 January 1992) and 
WO 93/03769 (published 4 March 1993); Lebkowski et al., Molec. Cell. Biol. (1988) 

20 8:3988-3996; Vincent et al., Vaccines 90 (1990) (Cold Spring Harbor Laboratory 
Press); Carter, B.J. Current Opinion in Biotechnology (1992) 3:533-539; Muzyczka, 
N. Current Topics in Microbiol, and Immunol. (1992) 158:97-129; Kotin, R.M. 
Human Gene Therapy (1994) 5:793-801; Shelling and Smith, Gene Therapy (1994) 
1:165-169; and Zhou et al., J. Exp. Med. (1994) 179:1867-1875. 

25 Another vector system useful for delivering the polynucleotides of the present 

invention is the enterically administered recombinant poxvirus vaccines described by 
Small, Jr., P.A., et al. (U.S. Patent No. 5,676,950, issued October 14, 1997). 

Additional viral vectors that will find use for delivering the nucleic acid 
molecules encoding the antigens of interest include those derived from the pox family 

30 of viruses, including vaccinia virus and avian poxvirus. By way of example, vaccinia 
virus recombinants expressing the genes can be constructed as follows. The DNA 
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encoding the particular immunogenic HIV polypeptide coding sequence is first 
inserted into an appropriate vector so that it is adjacent to a vaccinia promoter and 
flanking vaccinia DNA sequences, such as the sequence encoding thymidine kinase 
(TK). This vector is then used to transfect cells that are simultaneously infected with 
5 vaccinia. Homologous recombination serves to insert the vaccinia promoter plus the 
gene encoding the coding sequences of interest into the viral genome. The resulting 
TK recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyiuidine and picking viral plaques resistant thereto. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can 

10 also be used to deliver the genes. Recombinant avipox viruses, expressing 

immunogens from mammalian pathogens, are known to confer protective immunity 
when administered to non-avian species. The use of an avipox vector is particularly 
desirable in human and other mammalian species since members of the avipox genus 
can only productively replicate in susceptible avian species and therefore are not 

15 infective in mammalian cells. Methods for producing recombinant avipoxviruses are 
known in the art and employ genetic recombination, as described above with respect 
to the productioii of vaccinia viruses. See, e.g., PCT International Publication Nos. 
WO 91/12882; WO 89/03429; and WO 92/03545. 

Molecular conjugate vectors, such as the adenovirus chimeric vectors 

20 described in Michael et al, J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al., 
Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery. 

Members of the Alphavirus genus, such as, but not limited to, vectors derived 
from the Sindbis, Semliki Forest, and Venezuelan Equine Encephalitis viruses, will 
also find use as viral vectors for deUvering the polynucleotides of the present ^ 

25 invention (for example, first and second synthetic gpl40-polypeptide encoding 

expression cassette, wherein the first and second gpl40 polypeptides are analogous 
and derived from different HIV subtypes, serotypes, or strains). For a description of 
Sindbis-virus derived vectors useful for the practice of the instant methods, see, 
Dubensky et al., J. Virol. (1996) 70:508-519; and PCT International PubUcation Nos. 

30 WO 95/07995 and WO 96/1 7072; as well as, Dubensky, Jr., T.W., et al., U.S. Patent 
No. 5,843,723, issued December 1, 1998, and Dubensky, Jr., T.W., U.S. Patent No. 
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5,789,245, issued August 4, 1998. Preferred expression systems include, but are not 
limited to, eucaryotic layered vector initiation systems (e.g., US Patent No. 6,015,686, 
US Patent No. 5, 814,482, US Patent No. 6,015,694, US Patent No. 5,789,245, EP 
1029068A2, PCT International Publication No. WO 9918226A2/A3, EP 00907746A2, 
5 PCT International Publication No. WO 9738087A2). Exemplary expression systems 
include, but are not limited to, chimeric alphavirus replicon particles, for example, 
those that form VEE and SIN (see, e.g., Perri, et al., "An alphavirus replicon particle 
chimera derived from Venezuelan equine encephalitis and Sindbis viruses is a potent 
gene-based vaccine delivery vector," J. Virol 2003, 77(19), in press; PCT 

10 WO02/099035; USSN 10/310734, filed Dec 4 2002). Such alphavirus-based vector 
systems can be used in a prime or as a boost in DNA-primed subjects or potentially as 
a stand-alone immunization method for the induction of neutralizing antibodies using 
the multivalent approaches described herein. 

Expression cassette delivery vectors may also include tissue-specific promoters 

1 5 to drive expression of one or more genes or sequences of interest. 

Expression cassette delivery vector constructs may be generated such that 
more than one gene of interest is expressed. This may be accomplished through the 
use of di- or oligo-cistronic cassettes (e.g., where the coding regions are separated by 
8Q nucleotides or less, see generally Levin et al.. Gene 108:167-174, 1991), or through 

20 the use of Internal Ribosome Entry Sites ("IRES"). 

Synthetic expression cassettes of interest can also be delivered without a viral 
vector. For example, delivery of the expression cassettes of the present invention can 
also be accomplished using eucaryotic expression vectors comprising CMV -derived 
elements, such vectors include, but are not limited to, the following: pCMVKm2, 

25 pCMV-link pCMVPLEdhfi-, and pCMV6a (see Example 1 ). For example, a synthetic 
DNA expression cassette of the present invention, e.g., one encoding gpl40 
polypeptide, may be cloned into the following eucaryotic expression vectors: 
pCMVKm2, for transient expression assays and DNA immunization studies, the 
pCMVKin2 vector is derived firom pCMV6a (Chapman et al., Nuc. Acids Res. (1991) 

30 19:3979-3986) and comprises a kanamycin selectable marker, a ColEl origin of 

replication, a CMV promoter enhancer and Intron A, followed by an insertion site for 
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the synthetic sequences described below followed by a polyadenylation signal derived 
from bovine growth hormone ~ the pCMVKm2 vector differs from the pCMV-link 
vector only in that a polylinker site is inserted into pCMVKm2 to generate pCMV- 
link; pESN2dhfr and pCMVPLEdhfr, for expression in Chinese Hamster Ovary 
5 (CHO) cells; and, pAcCl 3, a shuttle vector for use in the Baculovirus expression 
system (pAcC13, is derived from pAcC12 which is described by Munemitsu S., et al., 
Mol Cell Biol. 10(ll):5977-5982, 1990). 

In additon, the expression cassettes of the present invention can be packaged in 
liposomes prior to delivery to the subject or to cells derived therefrom. Lipid 

10 encapsulation is generally accomplished using liposomes which are able to stably bind 
or entrap and retain nucleic acid. The ratio of condensed DNA to lipid preparation can 
vary but will generally be around 1 : 1 (mg DNA. micromoles lipid), or more of lipid. 
For a review of the use of liposomes as carriers for delivery of nucleic acids, see, Hug 
and Sleight, Biochim. Biophys. Acta. (1991) 1097 : 1-17; Straubinger et al., in Methods 

15 ofEnzymology (1983), Vol. 101, pp. 512-527. 

Liposomal preparations for use in the present invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations, with 
cationic liposomes particularly preferred. Cationic liposomes have been shown to 
mediate intracellular delivery of plasmid DNA (Feigner et al., Proc. Natl. Acad. Sci. 

20 USA (1987) 84:7413-7416); mRNA (Malone et al., Proc. Natl. Acad Sci. USA (1989) 
86:6077-6081); and purified transcription factors (Debs et al., J. Biol. Chem. (1990) 
265:10189-10192), in fimctional form. 

Cationic liposomes are readily available. For example, N[ 1-2,3- 
dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 

25 under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, 
Feigner et al., Proc. Natl. Acad Sci. USA (1987) 84:7413-7416). Other commercially 
available lipids include (DDAB/DOPE) and DOTAP/DOPE (Boerhinger). Other 
cationic liposomes can be prepared from readily available materials using techniques 
well known in the art. See, e.g., Szoka et al., Proc. Natl. Acad. Sci. USA (1978) 

30 75:4194-4198; PCT International Publication No. WO 90/1 1092 for a description of 
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the synthesis of DOTAP (l,2-bis(oleoyloxy)-3-(trimethylaminomo)propane) 
liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as, from 
Avanti Polar Lipids (Birmingham, AL), or can be easily prepared using readily 
5 available materials. Such materials include phosphatidyl choline, cholesterol, 
phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl ethanolamine (DOPE), 
among others. These materials can also be mixed with the DOTMA and DOTAP 
starting materials in appropriate ratios. Methods for making liposomes using these 

1 0 materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small 
unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various 
liposome-nucleic acid complexes are prepared using methods known in the art. See, 
e.g., Straubinger et al., in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 512- 

15 527; Szoka et al., Proc. Natl. Acad. Sci. USA (1978) 75:4194-4198; Papahadjopoulos 
et al., Biochim. Biophys. Acta (1975) 394:483; Wilson et al.. Cell (1979) 17:77); 
Deamer and Bangham, Biochim. Biophys. Acta (1976) 443:629; Ostro et al., Biochem. 
Biophys. Res. Commun. (1977) 76:836; Fraley et al, Proc. Natl. Acad. Sci. USA 
(1979) 76:3348); Enoch and Strittmatter, Proc. Natl. Acad Sci. USA (1979) 76:145); 

20 Fraley et al, J. Biol. Chem. (1980) 255:10431; Szoka and Papahadjopoulos, Proc 
Natl. Acad Sci. USA (1978) 75:145; and Schaefer-Ridder et al., Science (1982) 
215:166. 

The DNA and/or protein antigen(s) can also be delivered in cochleate lipid 
compositions similar to those described by Papahadjopoulos et al., Biochem. Biophys. 
25 Acta. (1975) 394:483-491. See, also, U.S. Patent Nos. 4,663,161 and 4,871,488. 

The expression cassettes of interest may also be encapsulated, adsorbed to, or 
associated with, particulate carriers. Such carriers present multiple copies of a 
selected antigen to the immune system and promote trapping and retention of antigens 
in local lymph nodes. The particles can be phagocytosed by macrophages and can 
30 enhance antigen presentation through cytokine release. Examples of particulate 
carriers include those derived from polymethyl methacrylate polymers, as well as 
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microparticles derived from poly(lactides) and poly(lactide-co-glycolides), known as 
PLG. See, e.g., Jeffery et al., Pharm. Res. (1993) 10:362-368; McGee JP, et al., J 
Microencapsul. 14(2):197-210, 1997; O'HaganDT, et al., Vaccine ll(2):149-54, 
1993. Suitable microparticles may also be manufactured in the presence of charged 
5 detergents, such as anionic or cationic detergents, to yield microparticles with a 
surface having a net negative or a net positive charge. For example, microparticles 
manufactured with anionic detergents, such as hexadecyltrimethylammonium bromide 
(CTAB), i.e. CTAB-PLG microparticles, adsorb negatively charged macromolecules, 
such as DNA. (see, e.g., Int'l AppHcation Number PCTAJS99/17308). 

1 0 Furthermore, other particulate systems and polymers can be used for the in 

vivo or ex vivo delivery of the gene of interest. For example, polymers such as 
polylysine, polyarginine, polyomithine, spermine, spermidine, as well as conjugates of 
these molecules, are useful for transferring a nucleic acid of interest. Similarly, DEAE 
dextran-mediated transfection, calcium phosphate precipitation or precipitation using 

15 other insoluble inorganic salts, such as strontium phosphate, aluminum silicates 

including bentonite and kaolin, chromic oxide, magnesium silicate, talc, and the like, 
will find use with the present methods. See, e.g., Feigner, P.L., Advanced Drug 
Delivery Reviews (1990) 5:163-187, for a review of delivery systems useful for gene 
transfer. Peptoids (Zuckerman, R.N., et al, U.S. Patent No. 5,831,005, issued 

20 November 3, 1998) may also be used for delivery of a construct of the present 
invention. 

In some embodiments of the present invention, alum and PLG are useful 
delivery adjuvants that enhance immunity to polynucleotide vaccines (e.g., DNA 
vaccines). Further embodiments include, but are not limited to, toxoids, cytokines, 

25 and co-stimulatory molecules may also be used as genetic adjuvants with 
polynucleotide vaccines. 

Additionally, biolistic delivery systems employing particulate carriers such as 
gold and tungsten, are especially useful for delivering synthetic expression cassettes of 
the present invention. The particles are coated with the synthetic expression 

30 cassette(s) to be delivered and accelerated to high velocity, generally imder a reduced 
atmosphere, using a gim powder discharge from a "gene gun." For a description of 
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such techniques, and apparatuses useful therefore, see, e.g., U.S. Patent Nos. 
4,945,050; 5,036,006; 5,100,792; 5,179,022; 5,371,015; and 5,478,744. Also, needle- 
less injection systems can be used (DaviSj H.L., et al, Vaccine 12:1503-1509, 1994; 
Bioject, Inc., Portland, OR). 
5 Recombinant vectors carrying a synthetic expression cassette of the present 

invention are formulated into compositions for deUvery to the vertebrate subject. 
These compositions may either be prophylactic (to prevent infection) or therapeutic (to 
treat disease after infection). If prevention of disease is desired, the compositions are 
generally administered prior to primary infection with the pathogen of interest. If 

1 0 treatment is desired, e.g., the reduction of symptoms or recurrences, the compositions 
are generally administered subsequent to primary infection. The compositions will 
comprise a "therapeutically effective amount" of the gene of interest such that an 
amount of the antigen can be produced in vivo so that an immune response is 
generated in the individual to which it is administered. The exact amount necessary 

1 5 will vary depending on the subject being treated; the age and general condition of the 
subject to be treated; the capacity of the subject's immune system to synthesize 
antibodies; the degree of protection desired; the severity of the condition being treated; 
the particular antigen selected and its mode of administration, among other factors. 
An appropriate effective amount can be readily determined by one of skill in the art. 

20 Thus, a "therapeutically effective amount" will fall in a relatively broad range that can 
be determined through routine trials. 

The compositions will generally include one or more "pharmaceutically 
acceptable excipients or vehicles" such as water, saline, glycerol, polyethyleneglycol, 
hyaluronic acid, ethanol, etc. Additionally, auxiliary substances, such as wetting or 

25 emulsifying agents, pH buffering substances, and the like, may be present in such 
vehicles. Certain facilitators of nucleic acid uptake and/or expression can also be 
included in the compositions or coadministered, such as, but not hmited to, 
bupivacaine, cardiotoxin and sucrose. 

Once formulated, the compositions of the invention can be administered 

30 directly to the subject (e.g., as described above) or, alternatively, delivered ex vivo, to 
cells derived from the subject, using methods such as those described above. For 
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example, methods for the ex vivo dehvery and reimplantation of transformed cells into 
a subject are known in the art and can include, e.g., dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, lipofectamine and 
LT-1 mediated transfection, protoplast fusion, electroporation, encapsulation of the 
5 polynucleotide(s) (with or without the corresponding antigen) in liposomes, and direct 
microinjection of the DNA into nuclei. 

Direct delivery of synthetic expression cassette compositions in vivo will 
generally be accompUshed with or without viral vectors, as described above, by 
injection using either a conventional syringe or a gene gun, such as the Accell® gene 

10 delivery system (PowderJect Technologies, Inc., Oxford, England). The constructs 
can be injected either subcutaneously, epidermally, intradermally, intramucosally such 
as nasally, rectally and vaginally, intraperitoneally, intravenously, orally or 
intramuscularly. Dehvery of DNA into cells of the epidermis is particularly preferred 
as this mode of administration provides access to skin-associated lymphoid cells and 

15 provides for a transient presence ofDNA in the recipient. Other modes of 

administration include oral and pulmonary administration, suppositories, needle-less 
injection, transcutaneous and transdermal applications. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. Administration of polypeptides 
encoding immunogenic polypeptides is combined with administration of analogous 

20 immunogenic polypeptides following the methods of the present invention. 

2.3.4 Expression of Synthetic Sequences Encoding HIV-1 Polypeptides 
AND Related Polypeptides 

Immunogenic viral polypeptide-encoding sequences of the present invention 
25 can be cloned into a number of different expression vectors/host cell systems to 
provide immunogenic polypeptides for the polypeptide component of the immune- 
response generating compositions of the present invention. For example, DNA 
fragments encoding HIV polypeptides can be cloned into eucaryotic expression 
vectors, including, a transient expression vector, CMV-promoter-based mammalian 
30 vectors, and a shuttle vector for use in baculovirus expression systems. Synthetic 
polynucleotide sequences (e.g., codon optimized polynucleotide sequences) and wild- 
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type sequences can typically be cloned into the same vectors. Numerous cloning 
vectors ars known to those of skill in the art, and the selection of an appropriate 
cloning vector is a matter of choice. See, generally, Sambrook et al, supra. The 
vector is then used to transform an appropriate host cell. Suitable recombinant 
5 expression systems include, but are not limited to, bacterial, mammalian, 

baculovirus/insect, vaccinia, Semliki Forest virus (SFV), Alphaviruses (such as, 
Sindbis, Venezuelan Equine Encephalitis (VEE)), mammalian, yeast and Xenopus 
expression systems, well known in the art. Particularly preferred expression systems 
are mammahan cell lines, vaccinia, Sindbis, eucaryotic layered vector initiation 

10 systems (e.g., US Patent No. 6,015,686, US Patent No. 5, 814,482, US Patent No. 
6,015,694, US Patent No. 5,789,245, EP 1029068A2, PCT Litemational Publication 
No. WO 9918226A2/A3, EP 00907746A2, PCT International Publication No. WO 
973 8087 A2), insect and yeast systems. 

A number of host cells for such expression systems are also known in the art. 

1 5 For example, mammalian cell lines are known in the art and include immortalized cell 
lines available from the American Type Culture Collection (A.T.C.C.), such as, but 
not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney 
(BHK) ceils, monkey kidney cells (COS), as well as others. Similarly, bacterial hosts 
such as E. coli. Bacillus subtilis, and Streptococcus spp., wiW find use with the present 

20 expression constructs. Yeast hosts useful in the present invention include inter alia, 
Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansenula 
polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Pichia guillerimondii, 
Pichia pastoris, Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells for 
use with baculovirus expression vectors include, inter alia, Aedes aegypti, Autographa 

25 californica, Bombyx mori, Drosophila melanogaster, Spodoptera frugiperda, and 

Trichoplusia ni. See, e.g.. Summers and Smith, Texas Agricultural Experiment Station 
Bulletin No. 1555 {mi). 

Viral vectors can be used for expression of polypeptides in eucaryotic cells, 
such as those derived from the pox family of viruses, including vaccinia virus and 

30 avian poxvirus. For example, a vaccinia based infection/transfection system, as 

described in Tomei et al., J. Virol. (1993) 67:4017-4026 and Selby et al., J. Gen. Virol. 
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(1993) 74:1103-11 13, will also find use with the present invention. A vaccinia based 
infection/transfection system can be conveniently used to provide for inducible, 
transient expression of the coding sequences of interest in a host cell. In this system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
5 bacteriophage 17 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide of interest, driven by a T7 promoter. The 
polymerase expressed in the cytoplasm fi-om the vaccinia virus recombinant 
transcribes the transfected DNA into RNA that is then translated into protein by the 

1 0 host translational machinery. The method provides for high level, transient, 

cytoplasmic production of large quantities of RNA and its translation products. See, 
e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743-6747; Fuerst et 
al., Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

As an alternative approach to infection with vaccinia or avipox virus 

1 5 recombinants, an amplification system can be used that will lead to high level 

expression following introduction into host cells. Specifically, a T7 RNA polymerase 
promoter preceding the coding region for T7 RNA polymerase can be engineered. 
Translation of RNA derived firom this template will generate T7 RNA polymerase 
which in turn will transcribe more template. Concomitantly, there will be a cDNA 

20 whose expression is under the control of the T7 promoter. Thus, some of the T7 RNA 
polymerase generated from translation of the amplification template RNA will lead to 
transcription of the desired gene. Because some T7 RNA polymerase is required to 
mitiate the amplification, T7 RNA polymerase can be introduced into cells along with 
the template(s) to prime the transcription reaction. The polymerase can be introduced 

25 as a protein or on a plasmid encoding the RNA polymerase. For a further discussion 
of T7 systems and their use for transforming cells, see, e.g., PCT International 
Publication No. WO 94/269 1 1 ; Studier and Moffatt, J. Mol. Biol. ( 1 986) 189: 113-1 30; 
Deng and Wolff, Gene (1994) 143:245-249; Gao et al, Biochem. Biophys. Res. 
Commun. (1994) 200:1201-1206; Gao and Huang, Nuc. Acids Res. (1993) 21:2867- 

30 2872; Chen et al., Nuc. Acids Res. (1994) 22:21 14-2120; and U.S. Patent No. 
5,135,855. 
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These vectors are transfected into an appropriate host cell. The cell lines are 
then cultured under appropriate conditions and the levels of any appropriate 
polypeptide product can be evaluated in supematants. For example, p24 can be used to 
evaluate Gag expression; gpl60, gpl40 or gpl20 can be used to evaluate Env 
5 expression; p6pol can be used to evaluate Pol expression; prot can be used to evaluate 
protease; pi 5 for RNAseH; p31 for Integrase; and other appropriate polypeptides for 
Vif, Vpr, Tat, Rev, Vpu and Nef 

Further, modified polypeptides can also be used, for example, other Env 
polypeptides include, but are not limited to, for example, native gpl60, oligomeric 
10 gpl40, monomeric gpI20 as well as modified and/or synthetic sequences of these 
polypeptides. 

Western Blot analysis can be used to show that cells containing the synthetic 
expression cassette produce the expected protein, typically at higher per-cell 
concentrations than cells containing the native expression cassette. The HFV proteins 

1 5 can be seen in both cell lysates and supematants. 

Fractionation of the supematants fi-om mammalian cells transfected with the 
synthetic expression cassette can be used to show that the cassettes provide superior 
production of HIV proteins and relative to the wild-type sequences. 

Efficient expression of these HIV-containing polypeptides in mammalian cell 

20 lines provides the following benefits: the polypeptides are free of baculovirus 

contaminants; production by established methods approved by the FDA; increased 
purity; greater yields (relative to native coding sequences); and a novel method of 
producing the Sub HIV-containing polypeptides in CHO cells which is not feasible in 
the absence of the increased expression obtained using the constructs of the present 

25 invention. Exemplary Manraialian cell lines include, but are not limited to, BHK, 
VERO, HT1080, 293, 293T, RD, COS-7, CHO, Jurkat, HUT, SUPT, C8166, 
MOLT4/clone8, MT-2, MT-4, H9, PMl, CEM, and CEMX174 (such cell lines are 
available, for example, from the A.T.C.C.). 

The desired polypeptide encoding sequences can be cloned into any number of 

3 0 commercially available vectors to generate expression of the polypeptide in an 

appropriate host system. These systems include, but are not limited to, the following: 
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baculovirus expression {Reilly, P.R., et al, Baculovirus Expression Vectors: A 
Laboratory Manual (1992); Beames, et al, Biotechniques 11:378 (1991); 
Pharmingen; Clontech, Palo Alto, CA)}, vaccinia expression {Earl, P. L., et al., 
"Expression of proteins in mammalian cells using vaccinia" In Current Protocols in 
5 Molecular Biology (F. M. Ausubel, et al. Eds.), Greene Publishing Associates & 

Wiley Interscience, New York (1991); Moss, B., et al., U.S. Patent Number 5,135,855, 
issued 4 August 1992}, expression in bacteria {Ausubel, P.M., et al.. Current 
Protocols in Molecular Biology. John Wiley and Sons, Inc., Media PA; 
Clontech}, expression in yeast {Rosenberg, S. and Tekamp-Olson, P., U.S. Patent No. 

10 RE35,749, issued, March 17, 1998; Shuster, J.R., U.S. Patent No. 5,629,203, issued 
May 13, 1997; GelUssen, G., et al, Antonie Van Leeuwenhoek, 62(l-2):79-93 (1992); 
Romanes, M.A., et al. Yeast 8(6):423-488 (1992); Goeddel, D.V., Methods in 
Enzymology 185 (1990); Guthrie, C, and G.R. Fink, Methods in Enzymology 194 
(1991)}, expression in mammalian cells (Clontech; Gibco-BRL, Ground Island, NY; 

1 5 e.g., Chinese hamster ovary (CHO) cell lines (Haynes, J., et al, Nuc. Acid. Res. 
11:687-706 (1983); 1983, Lau, Y.F., etal,Mol Cell Biol 4:1469-1475 (1984); 
Kaufinan, R. J., "Selection and coamplification of heterologous genes in mammalian 
cells," va. Methods in Enzymology, vol. 185, pp537-566. Academic Press, Inc., San 
Diego CA (1991)}, and expression in plant cells {plant cloning vectors, Clontech 

20 Laboratories, Inc., Palo Alto, CA, and Pharmacia LKB Biotechnology, Inc., 

Pistcataway, NJ; Hood, E., et al, J. Bacterial. 168:1291-1301 (1986); Nagel, R., et al. 
FEMS Microbiol Lett. 67:325 (1990); An, et al, "Binary Vectors", and others in 
Plant Molecular Bioloev Manual A3:l-19 (1988); Miki, B.L.A., et al. pp.249-265, 
and others in Plant DNA Infectious Agents (Hohn, T., et al, eds.) Springer- Verlag, 

25 Wien, Austria, (1987); Plant Molecular Biology: Essential Techniques, P.G. Jones 
and J.M. Sutton, New York, J. Wiley, 1997; Miglani, Gurbachan Dictionary of Plant 
Genetics and Molecular Biology, New York, Food Products Press, 1998; Henry, R. J., 
Practical Applications of Plant Molecular Biology, New York, Chapman & Hall, 
1997}. 

30 In addition to the mammalian, insect, and yeast vectors, the synthetic 

expression cassettes of the present invention can be incorporated into a variety of 
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expression vectors using selected expression control elements. Appropriate vectors 
and control elements for any given cell can be selected by one having ordinary skill in 
the art in view of the teachings of the present specification and information known in 
the art about expression vectors. 
5 For example, a synthetic coding sequence can be inserted into a vector that 

includes control elements operably linked to the desired coding sequence, which allow 
for the expression of the coding sequence in a selected cell-type. For example, typical 
promoters for mammahan cell expression include the SV40 early promoter, a CMV 
promoter such as the CMV immediate early promoter (a CMV promoter can include 

1 0 intron A), RSV, HIV-Ltr, the mouse mammary tumor virus LTR promoter (MMLV- 
Itr), the adenovirus major late promoter (Ad MLP), and the herpes simplex virus 
promoter, among others. Other nonviral promoters, such as a promoter derived from 
the murine metallothionein gene, will also find use for mammalian expression. 
Typically, transcription termination and polyadenylation sequences will also be 

15 present, located 3' to the translation stop codon. Preferably, a sequence for 

optimization of initiation of translation, located 5' to the coding sequence, is also 
present. Examples of transcription terminator/polyadenylation signals include those 
derived from SV40, as described in Sambrook, et al., supra, as well as a bovine 
growth hormone terminator sequence. Introns, containing splice donor and acceptor 

20 sites, may also be designed into the constructs for use with the present invention 
(Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986). 

Enhancer elements may also be used herein to increase expression levels of the 
mammalian constructs. Examples include the SV40 early gene enhancer, as described 
in Dijkema et al., EMBO J. (1985) 4:761, the enhancer/promoter derived from the long 

25 terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman et al., Proc. 
Natl. Acad. Sci. USA (1982b) 79:6777 and elements derived from human CMV, as 
described in Boshart et al.. Cell (1985) 41:521, such as elements included in the CMV 
intron A sequence (Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986). 

Also included in the invention are expression cassettes, comprising coding 

30 sequences and expression confrol elements that allow expression of the coding regions 
in a suitable host. The control elements generally include a promoter, translation 
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initiation codon, and translation and transcription termination sequences, and an 
insertion site for introducing the insert into the vector. Translational control elements 
useflil in expression of the polypeptides of the present invention have been reviewed 
by M. Kozak (e.g., Kozak, M., Mamm. Genome 7(8):563-574, 1996; Kozak, M., 
5 Biochimie 76(9):815-821, 1994; Kozak, U.J Cell Biol 108(2):229-241, 1989; Kozak, 
M., and Shatkin, A.J., Methods Enzymol 60:360-375, 1979). 

Expression in yeast systems has the advantage of commercial production. 
Recombinant protein production by vaccinia and CHO cell lines have the advantage of 
being mammalian expression systems. Further, vaccinia virus expression has several 

10 advantages including the following: (i) its wide host range; (ii) faithful post- 

transcriptional modification, processing, folding, transport, secretiori, and assembly of 
recombinant proteins; (iii) high level expression of relatively soluble recombinant 
proteins; and (iv) a large capacity to accommodate foreign DNA. 

The recombinantly expressed polypeptides from immunogenic HIV 

1 5 polypeptide-encoding expression cassettes are typically isolated from lysed cells or 
culture media. Purification can be carried out by methods known in the art including 
salt fractionation, ion exchange chromatography, gel filtration, size-exclusion 
chromatography, size-fractionation, and affinity chromatography, hnmunoaffinity 
chromatography can be employed using antibodies generated based on, for example, 

20 HIV antigens. Isolation of oligomeric forms of HIV envelope protein has been 
previously described (see, e.g., PCI International Application No. WO/00/39302), 

Advantages of expressing the proteins of the present invention using 
mammaUan cells include, but are not limited to, the following: well-established 
protocols for scale-up production; cell lines are suitable to meet good manufacturing 

25 process (GMP) standards; culture conditions for mammaUan cells are known in the art. 



2.3.5 immunogenicity enhancing components for use with the 
Polypeptide Component of the Present Invention 
Compositions of the present invention for generating an immune response in a 
30 mammal, for example, comprising a polynucleotide component and a polypeptide 

component, can include various excipients, adjuvants, carriers, auxiliary substances, 
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modulating agents, and the like. The polypeptide component of the compositions of 
the present invention include an amount of the polypeptide sufficient to mount an 
immunological response. An appropriate effective amount can be determined by one 
of skill in the art. 

5 The polypeptide component may comprise a carrier wherein the carrier is a 

molecule that does not itself induce the production of antibodies harmful to the 
individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polyglycoUic acids, polymeric amino acids, amino acid copolymers, lipid aggregates 

10 (such as oil droplets or liposomes), and inactive virus particles. Examples of 

particulate carriers include those derived from polymethyl methacrylate polymers, as 
well as microparticles derived from poly(lactides) and poly(lactide-co-glycolides), 
known as PLG. See, e.g., Jeffery et al., Pharm. Res. (1993) 10:362-368; McGee JP, et 
2X.,JMicroencapsul. 14(2): 197-210, 1997; O'HaganDT, et al.. Vaccine ll(2):149-54, 

15 1993. Such carriers are well known to those of ordinary skill in the art. Additionally, 
these carriers may ftmction as immunostimulating agents ("adjuvants"). Furthermore, 
the antigen may be conjugated to a bacterial toxoid, such as toxoid from diphtheria, 
tetanus, cholera, etc., as well as toxins derived from E. coli. 

Adjuvants may also be used to enhance the effectiveness of the compositions. 

20 Such adjuvants include, but are not limited to: (1) aluminum salts (alum), such as 
almninum hydroxide, aluminum phosphate, aluminum sulfate, etc.; (2) oil-in- water 
emulsion formulations (with or without other specific immunostimulating agents such 
as muramyl peptides (see below) or bacterial cell wall components), such as for 
example (a) MF59 (PCT hitemational Publication No. WO 90/14837), containing 5% 

25 Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts 
of MTP-PE (see below), although not required) formulated into submicron particles 
using a microfluidizer such as Model 1 lOY microfluidizer (Microfluidics, Newton, 
MA), (b) SAP, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked 
polymer L121, and thr-MDP (see below) either microfluidized into a submicron 

30 emulsion or vortexed to generate a larger particle size emulsion, and (c) Ribi™ 

adjuvant system (RAS), (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 
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0.2% Tween 80, and one or more bacterial cell wall components from the group 
consisting of monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell 
wall skeleton (CWS), preferably MPL + CWS (DetoxTW); (3) saponin adjuvants, such 
as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be used or particle 
5 generated therefrom such as ISCOMs (immunostimulating complexes); (4) Complete 
Freunds Adjuvant (CFA) and hicomplete Freunds Adjuvant (IF A); (5) cytokines, such 
as interleukins (IL-1, IL-2, etc.), macrophage colony stimulating factor (M-CSF), 
tumor necrosis factor (TNF), etc.; (6) oligonucleotides or polymeric molecules 
encoding immunostimulatory CpG motifs (Davis, H.L., et al., J, Immunology 

10 160:870-876, 1998; Sato, Y. et al.. Science 273:352-354, 1996) or complexes of 
antigens/oligonucleotides {Polymeric molecules include double and single stranded 
RNA and DNA, and backbone modifications thereof, for example, methylphosphonate 
linkages; or (7) detoxified mutants of a bacterial ADP-ribosylating toxin such as a 
cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), 

1 5 particularly LT-K63 (where lysine is substituted for the wild-type amino acid at 
position 63) LT-R72 (where arginine is substituted for the wild-type amino acid at 
position 72), CT-S109 (where serine is substituted for the wild-type amino acid at 
position 109), and PT-K9/G129 (where lysine is substituted for the wild-type amino 
acid at position 9 and glycine substituted at position 129) (see, e.g., PCT hitemational 

20 Publication Nos. WO/93/13202 and WO/92/19265); (8) Muramyl peptides include, 
but are not limited to, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N- 
acteyl-normuramyl-L-alanyl-D-isogliiatme(norTMDP),N-acetylmuramyl-L-alanyl-D- 
isogluatminyl-L-alanine-2-(r-2'-dipalmitoyl-sn-glycero-3-huydroxyphosphoryloxy)- 
ethylamine (MTP-PE), etc.; (9) Iscomatrix (CSL Limited,Victoria, Australia; also, see, 

25 e.g., Morein B, Bengtsson KL, "hnmunomodulation by iscoms, immune stimulating 
complexes," Methods. Sep;19(l):94-102, 1999) and (10) other substances that act as 
immunostimulating agents to enhance the effectiveness of the composition (e.g.. Alum 
and CpG oligonucleotides). 

Preferred adjuvants include, but are not limited to, MF59 and Iscomatrix. 

30 Dosage treatment with the polypeptide component of the immune stimulating 

compositions of the present invention may be a single dose schedule or a multiple 
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dose schedule. A multiple dose schedule is one in which a primary course of 
vaccination may be with 1-10 separate doses, followed by other doses given at 
subsequent time intervals, chosen to maintain and/or reinforce the inmiune response, 
for example at 1-4 months for a second dose, and if needed, a subsequent dose(s) after 
5' several months. The dosage regimen will also, at least in part, be determined by the 
need of the subject and be dependent on the judgment of the practitioner. 

Direct delivery of the polypeptide component of the immune-response 
generating compositions of the present invention is generally accompUshed, with or 
without adjuvants, by injection using either a conventional syringe or a gene gun, such 

10 as the Accell® gene delivery system (PowderJect Technologies, Inc., Oxford, 
England). The polypeptides can be injected either subcutaneously, epidermally, 
intradermally, intramucosally such as nasally, rectally and vaginally, intraperitoneally, 
intravenously, orally or intramuscularly. Other modes of administration include oral 
and pulmonary administration, suppositories, and needle-less injection. Dosage 

1 5 treatment may be a single dose schedule or a multiple dose schedule. Administration 
of polypeptides may also be combined with administration of adjuvants or other 
substances. 

2.3.6 IMIMUNOMODKLATORV MOLECULES 

20 In some embodiments of the present invention, gene transfer vectors can be 

constructed to encode a cytokine or other immunomodulatory molecule. For example, 
nucleic acid sequences encoding native lL-2 and gamma-interferon can be obtained as 
described in US Patent Nos. 4,738,927 and 5,326,859, respectively, while useful 
muteins of these proteins can be obtained as described in U.S. Patent No. 4,853,332. 

25 Nucleic acid sequences encoding the short and long forms of mCSF can be obtained as 
described in US Patent Nos. 4,847,201 and 4,879,227, respectively. In particular 
aspects of the invention, retroviral vectors expressing cytokine or immunomodulatory 
genes can be produced (e.g., PCT International Publication No. WO/94/02951, entitled 
"Compositions and Methods for Cancer Immunotherapy). 

30 Examples of suitable immunomodulatory molecules for use herein include the 

following: IL-l and IL-2 (Karupiah et al. (1990) J. Immunology 144:290-298, Weber 
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et al. (1987) J. Exp. Med. 166:1716-1733, Gansbacher et al. (1990) J. Exp. Med. 
172:1217-1224, and U.S. Patent No. 4,738,927); IL-3 and IL-4 (Tepper et al. (1989) 
Cell 57:503-512, Golumbek et al. (1991) Science 254:713-716, and U.S. Patent No. 
5,017,691); IL-5 and IL-6 (Brakenhof et al. (1987) J. Immunol. 139:4116-4121, and 
5 PCT International Publication No. WO 90/06370); IL-7 (U.S. Patent No. 4,965,195); 
IL-8, lL-9, IL-10, IL-11, IL-12, and IL-13 {Cytokine Bulletin, Summer 1994); IL-14 
and IL-15; alpha interferon (Pinter et al. (1991) Drugs 42:749-765, U.S. Patent Nos. 
4,892,743 and 4,966,843, PCT International Publication No. WO 85/02862, Nagata et 
al. (1980) Nature 284:316-320, Familletti et al. (\m) Methods in Enz. 78:387-394, 

10 Twu et al. (1989) Proc. Natl Acad. Sci. USA 86:2046-2050, and Faktor et al. (1990) 
Oncogene 5:867-872); beta-interferon (Seif et al. (1991) J. Virol. 65:664-671); 
gamma-interferons (Radford et al. (1991) The American Society ofHepatology 
20082015, Watanabe et al. (1989) Proc. Natl Acad Sci. USA 86:9456-9460, 
Gansbacher et al. (1990) Cancer Research 50:7820-7825, Maio et al. (1989) Can. 

15 Immunol Immunother. 30:34-42, and U.S. Patent Nos. 4,762,791 and 4,727,138); G- 
CSF (U.S. Patent Nos. 4,999,291 and 4,810,643); GM-CSF (PCT International 
Publication No. WO 85/04188). 

Immunomodulatory factors may also be agonists, antagonists, or ligands for 
these molecules. For example, soluble forms of receptors can often behave as 

20 antagonists for these types of factors, as can mutated forms of the factors themselves. 

Nucleic acid molecules that encode the above-described substances, as well as 
other nucleic acid molecules that are advantageous for use within the present 
invention, may be readily obtained from a variety of sources, including, for example, 
depositories such as the American Type Cultiire Collection, or from commercial 

25 sources such as British Bio- Technology Limited (Cowley, Oxford England). 

Representative examples include BBG 12 (containing the GM-CSF gene coding for 
the mature protein of 127 amino acids), BBG 6 (which contains sequences encoding 
gamma interferon), A.T.C.C. Deposit No. 39656 (which contains sequences encoding 
TNF), A.T.C.C. Deposit No. 20663 (which contains sequences encoding alpha- 

30 interferon), A.T.C.C. Deposit Nos. 31902, 31902 and 39517 (which contain sequences 
encoding beta-interferon), A.T.C.C. Deposit No. 67024 (which contains a sequence 
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which encodes Interleukin-lb), A.T.C.C. Deposit Nos. 39405, 39452, 39516, 39626 
and 39673 (which contain sequences encoding Interleukin-2), A.T.C.C. Deposit Nos. 
59399, 59398, and 67326 (which contain sequences encoding Interleukin-3), A.T.C.C. 
Deposit No. 57592 (which contains sequences encoding Literleukin-4), A.T.C.C. 
5 Deposit Nos. 59394 and 59395 (which contain sequences encoding Interleukin-5), and 
A.T.C.C. Deposit No. 67153 (which contains sequences encoding Interleukin-6). 

Plasmids containing cytokine genes or immunomodulatory genes (PCT 
Litemational Publication Nos. WO 94/02951 and WO 96/21015) can be digested with 
appropriate restriction enzymes, and DNA fragments containing the particular gene of 

10 interest can be inserted into a gene transfer vector using standard molecular biology 
techniques. (See, e.g., Sambrook et al., supra., or Ausubel et al. (eds) Current 
Protocols in Molecular Biology, Greene Publishing and Wiley-Interscience). 

Polynucleotide sequences coding for the above-described molecules can be 
obtained using recombinant methods, such as by screening cDNA and genomic 

1 5 libraries from cells expressing the gene, or by deriving the gene from a vector knovm 
to include the same. For example, plasmids that contain sequences that encode altered 
cellular products may be obtained from a depository such as the A.T.C.C, or from 
commercial sources. Plasmids containing the nucleotide sequences of interest can be 
digested with appropriate restriction enzymes, and DNA fragments containing the 

20 nucleotide sequences can be inserted into a gene transfer vector using standard 
molecular biology techniques. 

Alternatively, cDNA sequences for use with the present invention may be 
obtained from cells that express or contain the sequences, using standard techniques, 
such as phenol extraction and PCR of cDNA or genomic DNA. See, e.g., Sambrook 

25 et al., supra, for a description of techniques used to obtain and isolate DNA. Briefly, 
mRNA from a cell which expresses the gene of interest can be reverse transcribed 
with reverse transcriptase using oligo-dT or random primers. The single stranded 
cDNA may then be amplified by PCR (see U.S. Patent Nos. 4,683,202, 4,683,195 and 
4,800,159, see also PCR Technology: Principles and Applications for DNA 

30 Amplification, Erlich (ed.), Stockton Press, 1989)) using oligonucleotide primers 
complementary to sequences on either side of desired sequences. 
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The nucleotide sequence of interest can also be produced synthetically, rather 
than cloned, using a DNA synthesizer {e.g., an Applied Biosystems Model 392 DNA 
Synthesizer, available from ABI, Foster City, California). The nucleotide sequence 
can be designed with the appropriate codons for the expression product desired. The 
5 complete sequence is assembled from overlapping oligonucleotides prepared by 
standard methods and assembled into a complete coding sequence. See, e.g., Edge 
(1981) Nature 292:756; Nambair et al. (1984) Science 223:1299; Jay et al. (1984) J. 
Biol.Chem. 259:6311. 

10 2.4.0 Generation of Immune Response In Treated Subjects 

To evaluate efficacy, nucleic acid immunization using the polynucleotide 
component of the present invention (e.g., two expression cassettes each comprising a 
coding sequence for gpl40, wherein each coding sequence is derived from different 
HIV subtypes, serotypes, or strains) and antigenic immunization using the polypeptide 

1 5 component of the present invention (e.g., an oligomeric gp 1 40 wherein the coding 
sequence is derived from one of the HIV subtypes, serotypes, or strains represented in 
the polynucleotide component) can be performed, for example, as follows. 

Example 2 describes methods for the evaluation, in mice, of the 
immunogenicity of the compositions of the present invention used to induce immune 

20 response. The polynucleotide component described comprises two pCMVKM2 each 
carrying codon optimized coding sequences for gpl40 with delV2, the first coding 
sequence derived from SF162, subtype B, and the second coding sequence derived 
from TVl, subtype C. The mice are then immunized with oligomeric, codon 
optimized, gpl40 with delV2, derived from SF162, subtype B, polypeptide. Humoral 

25 and cellular immune responses are evaluated. The results of these assays are used to 
show the potency of the polynucleotide/polypeptide immunization methods of the 
present invention for the generation of an immxme response in mice. 

Example 3 describes in vivo immunization studies that may be carried out in a 
variety of laboratory animals (including, mice, guinea pigs, rabbits, rhesus macaques, 

30 and baboons). Results of these studies are used to demonstrate the usefiilness of the 
compositions and methods of the invention to generate immune responses, in 
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particular to generate broad and potent neutralizing activity against diverse HIV 
strains. 

Example 4 describes experiments performed in support of the present 
invention that evaluated immunogenicity regimens for various HIV polypeptide 
5 encoding plasmids used as primes and various HIV polypeptides used as boosts. In 
the example, the following vectors encoding gpUO proteins were employed: pCMV 
gpl40 dV2 SF162 DNA and pCMV gpl40 dV2 TVl DNA. These vectors comprise 
expression cassettes that encode gpl40 protein derived from two different HIV 
subtypes, subtype B (SF162) and subtype C (TVl). The V2 loop was deleted in both 

10 constructs and the coding sequences were codon optimized for expression in human 
cells. The specific gpl40 polynucleotides have been previously described (e.g., 
gpl40.modSF162.delV2, Figure 6, and gpl40.mut7.modSF162.delV2, Figure 7, see 
also, PCT International Publication No. WO/00/39302; and gpl40mod.TVl.delV2, 
Figure 8, and gpl40mod.TVl.mut7.delV2, Figure 9, see also PCT hitemational 

15 Publication No. WO/02/04493). The ability of the compositions and methods of the 
present invention to generate neutralizing antibodies was evaluated. The results of the 
assays for the presence of neutrahzing antibodies are presented in Figure 4 and Figure 
5. 

Figure 4 summarizes data showing the neutralization titers against HIV-1 
20 SF162 between seven experimental groups. These resuhs demonstrated that all groups 
showed strong neutralizing activity against the HIV-1 SF162 isolate. Further, 
neutralizing activity significantly increased at post 4* immunization compared to post 
3"^ immunizations. For the mixed (B+C) DNA prime and single protein boost, B 
protein gave a high boost to the mixed gene prime (B+C DNA + B prot), as did the C 
25 protein (B+C DNA + C prot). For the mixed DNA prime and protein boost, half dose 
(50ug) of protein (B+C DNA & prot (1/2)) induced neutralizing activity at least as 
well as the full dose of lOOug protein (B+C DNA & prot). 

Figure 5 summarizes data showing the neutralization titers against HIV-1 TVl 
(South African Subtype C) between seven experimental groups. These results 
30 demonstrated that all groups showed neutralizing activity against HIVl subtype C 

TVl isolate (as expected, because no Subtype C DNA or protein was used, the B DNA 
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+ B prot showed the lowest neutralizing activity). For the mismatched a single DNA 
prime and a single protein boost (C DNA + B prot), priming with C gene and boosting 
with B protein showed high titers, as did the B gene and B protein (B DNA + B prot). 
For the mixed (B+C) DNA prime and single protein boost, use of either B (B+C DNA 
5 + B prot) and C (B+C DNA + C prot) proteins had a similar boosting effect. 

Comparison of the data presented in Figure 4 and Figure 5 supports the 
combination methods of the present invention for generating an immune response in a 
subject, further, for generating neutralizing antibodies in inmiunized subjects. The 
data showed that the combination of DNA derived from different subtypes primed 

10 broad responses to multiple subtypes. This could be the result of the combined 
responses to subtype and/or sequence-specific continuous and/or discontinuous 
immunogenic epitopes as well as responses to the presentation of common conserved 
eptiopes in the oligomeric V2-deleted Env immunogens employed here. Furthermore, 
use of a single subtype protein was sufficient to boost broad neutralizing responses 

1 5 when immunity was primed with multiple subtypes of DNA. These results also 
demonstrated that use of lower doses of proteins mixture can also provide strong 
immune responses. 

These studies demonstrated the usefulness of the compositions (e.g., 
comprising a polynucleotide component and a polypeptide component) and methods 

20 of the invention to generate immune responses, in particular to generate broad and 
potent neutrahzing activity against diverse HIV subtypes and strains. It is readily 
apparent that the subject invention can be used to mount an immune response to a 
wide variety of antigens and hence to treat or prevent infection, particularly HIV 
infection. 

25 

3.0.0 Applications of the Present Invention to HIV 

While not desiring to be bound by any particular model, theory, or hypothesis, 
the following information is presented to provide a more complete understanding of 
the present invention. 

30 Protection against HTV infection will likely require potent and broadly reactive 

pre-existing neutralizing antibodies in vaccinated individuals exposed to a virus 
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challenge. Although cellular inmiune responses are desirable to control viremia in 
those who get infected, protection against infection has not been demonstrated for 
vaccine approaches that rely exclusively on the induction of these responses. For this 
reason, experiments performed in support of the present invention used combination 

5 prime-boost approaches that employ a polynucleotide component and a polypeptide 
component, wherein the polypeptide component encodes, for example, V-deleted 
envelope antigens from primary HIV isolates (e.g., R5 subtype B (HlV-lspiei) and 
subtype C (HIV-Itvi) strains), and the polypeptide component comprises at least one 
of these antigens. 

1 0 The polynucleotide component of the present invention may be delivered by 

enhanced DNA or RNA [polylactide co-glycolide (PLG) microparticle formulations or 
electroporation], adenovirus-based vectors, alphavirus replicons or replicon particles, 
polynucleotide or particle-based vaccine approaches. Efficient in vivo expression of 
plasmid encoded genes by electrical permeabilization has been described (see, e.g., 

1 5 ZuccheUi et al. (2000) / Virol. 74:1 1598-1 1607; Banga et al. (1998) Trends 

Biotechnol. 10:408-412; Heller et al. (1996) Febs Lett. 389:225-228; Mathiesen et al. 
(1999) Gene Ther. 4:508-514; Mir et al. (1999) Proc. Nat'l Acad Sci. USA 8:4262- 
4267; Nishi et al. (1996) Cancer Res: 5:1050-1055). The polypeptide component of 
the present invention may be administered, for example, by booster immunizations 

20 with Env proteins in MF59 or Iscomatrix adjuvant. 

All protein preparations were highly purified and extensively characterized by 
biophysical and immunochemical methodologies. Results from rabbit 
immunogenicity studies indicated that broad neutralizing antibody responses could be 
consistently induced against diverse HIV strains (Example 4). Moreover, using the 

25 combination prime-boost vaccine regimens, potent HIV antigen-specific CD4 + and 
CD8+ T-cell responses may also be generated. 

Although any HIV viral protein may also be employed in the practice of the 
present invention, in a preferred embodiment V1-, V2-, and/or V3-modified/deleted 
envelope DNA and corresponding polypeptides are good candidates for use in the 

30 compositions of the present invention. 
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One embodiment of this aspect of the present invention maybe described 
generally as follows. Antigens are selected for the vaccine composition(s). 
Polynucleotides encoding Env polypeptides and Env polypeptides are typically 
employed in a composition for generating an immunse response comprising a 
polynucleotide component and a polypeptide component. 

A nucleic acid prime is typically followed by at least one polypeptide boost. 
The boost may, for example, include adjuvanted HlV-derived polypeptides (e.g., 
analogous to those used for the DNA vaccinations), coding sequences for HIV-derived 
polypeptides (e.g., analogous to those used for the DNA vaccinations) encoded by a 
viral vector. Boosts may include further DNA vaccinations, and/or combinations of 
the foregoing. 

Further, different polypeptide antigens may be used in the boost relative to the 
initial vaccination and vice versa. In addition, the initial nucleic acid vaccination may 
be a viral vector comprising a DNA expression cassette construct. 

Some factors that may be considered in HIV envelope vaccine design are as 
follows. A fundamental criterion of an effective HIV vaccine is its ability to induce 
broad and potent neutralizing antibody responses against prevalent HIV strains. The 
important contribution of neutralizing antibodies in preventing the establishment of 
HIV, SIV and SHIV infection or delaying the onset of disease is highUghted by 
several studies. First, the emergence of neutralization-resistant viruses coincides or 
precedes the development of disease in infected animals (Bums (1993) J Virol. 
67:4104-13; Cheng-Mayer et al. (1999)7. Virol. 73:5294-5300; Narayan et al. (1999) 
Virology 256:54-63). Second, the pre-infusion of high concentrations of potent 
neutralizing monoclonal antibodies (mAbs) in the blood circulation of macaques, 
chimpanzees and SCID mice prior to their challenge with HIV, SIV or SHIV viruses, 
offers protection or delays the onset of disease (Conley et al. (1996) J. Virol. 70:6751- 
6758; Emini et al. (1992) Nature (London) 355:728-730; Gauduin et al. (1997) Nat 
Med. 3:1389-93; Mascola et al. {1999) J Virol. 73:4009-18; Mascola et al. (2000) 
Nature Med 6(2):207-210; Baba et al. (2000) Nature Med 6(2):200-206). Similarly, 
infusion of neutralizing antibodies collected from the serum of HIV- 1 -infected 
chimpanzees to naive pig-tailed macaques protects the latter animals from subsequent 
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viral challenge by SHIV viruses (Shibata et al (1999) Nature Medicine 5:204-210). 
Moreover, envelope-based vaccines have demonstrated protection against infection in 
non-human primate models. Vaccines that exclude Env-polypeptides generally confer 
less protective efficacy (see, e.g., Hu, S.L., et al., Recombinant subunit vaccines as an 
5 approach to study correlates of protection against primate lentivirus infection, 

hnmunol Lett. Jun;51(l-2):1 15-9 (1996); Amara, R.R., et al.. Critical role for Env as 
well as Gag-Pol in control of a simian-human immunodeficiency virus 89.6P 
challenge by a DNA prime/recombinant modified vaccinia virus Ankara vaccine, J 
Virol. Jun;76(12):6138-46 (2002)). 

1 0 Monomeric gp 1 20 protein-derived from the SF2 lab strain provided 

neutralization of HIV- 1 lab strains and protection against virus challenges in primate 
models (Verschoor, E.J., et al., (1999), "Comparison of immunity generated by 
nucleic acid, MF59 and ISCOM-formulated HIV-l gpl20 vaccines in rhesus 
macaques," J. Virology 73: 3292-3300). Primary gp 120 protein derived from Thai E 

15 field strains provided cross-subtype neutralization of lab strains (VanCott, T.C., et al., 
(1999) "Cross-subtype neutralizing antibodies induced in baboons by a subtype E 
gpl20 immunogen based on an R5 primary human immunodeficiency virus type 1 
envelope," J. Virology 73: 4640-4650). Primary subtype B oligomeric o-gpl40 
protein provided partial neutralization of subtype B primary (field) isolates (Bamett, 

20 S,W., et al. (2001) "The ability of an oUgomeric HfV-l envelope antigen to ehcit 
neutralizing antibodies against primary HfV-l isolates is improved following the 
partial deletion of the second hypervariable region," J. Virology, 75:5526-5540). 
Primary subtype B o-gpl40 delV2 DNA prime plus protein boost provided potent 
neutralization of diverse subtype B primary isolates and protection against virus 

25 challenge in primate models (Cherpelis, S., et al, (2000) "Vaccine-induced anti- 
envelope antibodies offer partial protection from SHIV infection to CD8+T-cell 
depleted rhesus macaques," J. Virology, 75, 1547-1550). 

Vaccine strategies for induction of potent, broadly reactive, neutralizing 
antibodies may be assisted by constinction of Envelope polypeptide stiiictures that 

30 expose conserved neutializing epitopes, for example, variable-region 

modifications/deletions and de-glycosylations, envelope protein-receptor complexes, 
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rational design based on crystal structure (e.g., beta-sheet deletions), and gp41 -fusion 
domain based immunogens. 

Stable CHO cell lines for envelope protein production have been developed 
using optimized envelope polypeptide coding sequences, including, but not limited to, 
5 the following: gpl20, o-gpl40, gpl20delV2, o-gpl40delV2, gpl20delVlV2, o- 
gpl40delVlV2. 

Exemplary antigenic compositions and immunogenicity studies in support of 
the compositions and methods of the present invention are presented in Example 4. 
In a first particular aspect of the present invention, the polynucleotide 

1 0 component of the present invention consists essentially of one polynucleotide 
encoding an HIV immunogenic polypeptide, and the polypeptide component 
comprises of one or more HIV immunogenic polypeptides analogous to the 
polypeptide encoded by said polynucleotide component, with the proviso that at least 
one HIV immunogenic polypeptide of the polypeptide component is derived from a 

1 5 different HIV subtype, serotype, or strain than the immunogenic polypeptide encoded 
by the polynucleotide component. In this context, the polynucleotide component 
consisting essentially of one polynucleotide encoding an HIV immunogenic 
polypeptide refers to the presence of one polynucleotide encoding one HIV 
immunogenic polypeptide in the composition. The polynucleotide composition may 

20 comprise further components in addition to the HIV immnunogenic polypeptide, such 
as immune enhancers, iramunoregulatory components, vector sequences (e.g., viral or 
non- viral), carriers, particles, excipients, expression control sequences, etc. In one 
embodiment of this aspect of the present invention, the HIV immunogenic 
polypeptide encoded by the polynucleotide component is derived from a subtype B 

25 strain, and at least one coding sequence of an HIV immunogenic polypeptide of the 
polypeptide component is derived from a subtype C strain. 

In a second particular aspect of the present invention, the polynucleotide 
component comprises two or more polynucleotide sequences comprising coding 
sequences for two or more analogous HIV immunogenic polypeptides, wherein the 

30 coding sequences for at least two of the HIV immunogenic polypeptides are derived 
from different HIV subtypes, serotypes, or strains, and a polypeptide component 
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comprising one or more HIV inmiunogenic polypeptides analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that, if the polypeptide 
component provides the same number or greater than the number of analogous HIV 
immunogenic polypeptides encoded by the polynucleotide component, then at least 

5 one of the HIV immunogenic polypeptides of the polypeptide composition is derived 
from a different HIV subtype, serotype, or strain than the HIV immunogenic 
polypeptides provided by the polynucleotide component. 

In one embodiment, the present invention includes a composition for use in 
generating an immune response in a subject, wherein the composition comprises a 

1 0 polynucleotide encoding an immunogenic HIV polypeptide and an analogous 

immunogenic HIV polypeptide from a different HIV subtype, serotype, or strain. The 
polynucleotide encoding an immunogenic HIV polypeptide is used for immunization 
via delivery of the polynucleotide (e.g., a prime), an analogous immunogenic HIV 
polypeptide derived from a different HIV subtype, serotype, or strain is used for 

1 5 immunization (e.g., a boost). For example, a DNA molecule is used for nucleic acid 
immunization, wherein the DNA molecule encodes an HIV envelope polypeptide (i) 
derived from an HIV Subtype C isolate, and (ii) that is codon optimized for expression 
in mammalian cells. This DNA immunization is followed by a protein boost using an 
HIV envelope polypeptide derived from an HIV Subtype B isolate. Exemplary 

20 envelope proteins include, but are not limited to, gpl20, gpUO, oligomenc gpl40, and 
gpl60, including mutated forms thereof (e.g., deletion of the V2 loop). One 
embodiment of this aspect of the present invention, comprises a composition for 
generating an immune response in a mammal, the composition comprising: a 
polynucleotide component having of a first polynucleotide encoding a first HIV 

25 immunogenic polypeptide; and a polypeptide component, comprising a second HIV 
immunogenic polypeptide, wherein the first and second immunogenic HIV 
polypeptide are derived from different HIV subtypes, serotypes, or strains, and (ii) the 
first and second immunogenic polypeptides encode analogous HIV polypeptides. 

A second embodiment the present invention includes a composition for use in 

30 generating an immune response in a subject, wherein the composition comprises a 
polynucleotide component comprising two or more polynucleotides encoding 
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immunogenic HIV polypeptides, derived from at least two different subtypes, 
serotypes, or strains, and a polypeptide component having a single, analogous, 
immunogenic HIV polypeptides derived from one of the subtypes, serotypes, or strains 
that is used for boosting. For example, two DNA molecules are used for nucleic acid 
5 immunization, wherein the first DNA molecule encodes an EJV envelope polypeptide 
(i) derived from an HIV Subtype C isolate, and (ii) that is codon optimized for 
expression in mammalian cells, and the second DNA molecule encodes an HIV 
envelope polypeptide (i) derived from an HIV Subtype B isolate, and (ii) that is codon 
optimized for expression in mammalian cells. This DNA immunization is followed by 

1 0 a protein boost using a single HIV envelope polypeptide (i) derived from an HIV 

Subtype B isolate or an HIV Subtype C isolate. Exemplary envelope proteins include, 
but are not limited to, gpl20, gpl40, oligomeric gpl40, and gpl60, including mutated 
forms thereof (e.g., deletion of the V2 loop). One embodiment of this aspect of the 
present invention comprises a composition for generating an immune response in a 

1 5 mammal, the composition comprising: a polynucleotide component comprising a first 
polynucleotide encoding a first immunogenic HIV polypeptide, and a second 
polynucleotide encoding a second immunogenic HIV polypeptide, wherein (i) the first 
and second immimogenic HIV polypeptide are derived from different HIV subtypes, 
and (ii) the first and second immunogenic polypeptides encode analogous HIV 

20 polypeptides, and a polypeptide component, having the first HIV immunogenic 

polypeptide, or the second HIV immunogenic polypeptide, with the proviso that the 
polypeptide component comprises at least one less HIV inmiunogenic polypeptide 
than is encoded by the polynucleotide component. 

The polynucleotide component may comprise further components as described 

25 herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 
component may comprise further components as described herein (e.g., carriers, 
adjuvants, immunoenhancers, etc.). 

In a third aspect, the present invention relates to the use of varied doses of 
polynucleotides and polypeptides in immimization methods (e.g., prime^oost 

30 methods), particularly the methods described herein. Thus, another aspect of the 
invention provides a method of generating an immime response in a subject 
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comprising administering a polynucleotide component consisting essentially of one 
polynucleotide encoding an HIV inmiunogenic polypeptide to a subject under 
conditions that are compatible with the expression of said polynucleotide in said 
subject for the production of the encoded HIV immunogenic polypeptide; and, 
5 administering a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
component is derived from a different HFV subtype than the subtype from which the 
immunogenic polypeptide encoded by the polynucleotide component is derived. 

10 Another aspect of the present invention provides a method of generating an 

irmnime response in a subject comprising administering a polynucleotide component 
comprising two or more polynucleotides comprising coding sequences for two or more 
analogous HIV immunogenic polypeptides, wherein the coding sequences for at least 
two of the HIV immunogenic polypeptides are derived from different HIV subtypes, 

15 to a subject under conditions that are compatible with the expression of said 

polynucleotides in said subject for the production of the encoded HIV immunogenic 
polypeptides; and, administering a polypeptide component that comprises one or 
more HTV immimogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that if the polypeptide component 

20 comprises the same number or greater than the number of analogous HIV 

immunogenic polypeptides encoded by the polynucleotide component, then at least 
one of the HIV immunogenic polypeptides of the polypeptide composition is derived 
from a different HIV subtype than the HIV immunogenic polypeptides provided by 
the polynucleotide component. 

25 In a further aspect, the invention provides a method of generating an immune 

response in a subject comprising 

providing a composition comprising a polynucleotide component consisting 
essentially of one polynucleotide encoding an HIV immunogenic polypeptide, and 
a polypeptide component comprising one or more HIV immunogenic 

30 polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HTV immunogenic polypeptide of the polypeptide 
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component is derived from a different HIV subtype than the subtype from which the 
immunogenic polypeptide encoded by the polynucleotide component is derived: 

administering a gene delivery vector comprising the polynucleotide of said 
polynucleotide component of the composition into said subject under conditions that 
5 are compatible with expression of said polynucleotide in said subject for the 
production of encoded HIV immunogenic polypeptides; and 

administering the polypeptide component to said subject. 
Yet another aspect of the invention provides a method of generating an 
inmiune response in a subject comprising 

1 0 providing a composition comprising a polynucleotide component comprising 

two or more polynucleotide sequences comprising coding sequences for two or more 
analogous HIV immunogenic polypeptides, wherein the coding sequences for at least 
two of the HIV immunogenic polypeptides are derived from different HIV subtypes, 
and a polypeptide component comprising one or more HIV immunogenic polypeptides 

1 5 analogous to the polypeptide encoded by said polynucleotide component, with the 
proviso that if the polypeptide component comprises the same nimiber or greater than 
the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV subtype than the HIV 

20 immunogenic polypeptides provided by the polynucleotide component; 

administering one or more gene delivery vectors comprising the 
polynucleotides of said polynucleotide component of the composition into said subject 
under conditions that are compatible with expression of said polynucleotides in said 
subject for the production of encoded HIV immunogenic polypeptides; and 

25 administering the polypeptide component to said subject. 

In any immunization method using, for example, a mixed polynucleotide prime 
(i.e., two or more polynucleotides encoding immunogenic HIV polypeptides derived 
from two or more; HIV subtypes, serotypes, or strains) in conjunction with a 
polypeptide boost the present invention includes using reduced doses of each single 

30 component to provide an equivalent immune response to using full doses of each 

component. In one embodiment, the high threshold of DNA is the maximum tolerable 
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dose of DNA (e.g., about 5 mg to about 10 mg total DNA), the low threshold of DNA 
is the minimum effective dose (e.g., about 2 ug to about 10 ug total DNA), the high 
threshold of protein is the maximum tolerable dose of protein (e.g., about 1 mg total 
protein), the low threshold of protein is the minimum effective dose (e.g., about 2 ug 
5 total protein). Experiments performed in support of the present invention 

demonstrated the efficacy of dividing the total DNA dose among the polynucleotides 
of the polynucleotide component (e.g., Example 4). Further, experiments performed 
in support of the present invention (e.g., Example 4) demonstrated the efficacy of 
dividing the total polypeptide dose among the polypeptides comprising the 

10 polypeptide component. The total DNA and total protein are both typically above the 
low threshold values. 

In a preferred embodiment, the total amount of DNA in a given DNA 
immunization has a high threshold of less than or equal to about 10 mg total DNA and 
greater than or equal to 1 mg total DNA, and the total amount of protein in a given 

1 5 polypeptide boost has a high threshold of less than or equal to about 200 ug total 

protein product and greater than or equal to 10 ug of total protein. For example, in an 
embodiment using a polynucleotide component having two DNA molecules each 
encoding an immunogenic HIV polypeptide the dose of each DNA molecule per 
subject may be one milligram of each DNA molecule encoding an immunogenic HIV 

20 polypeptide, for a total of 2 mg for the two DNA molecules, or 0.5 mg of each DNA 
molecule encoding an immunogenic HIV polypeptide, for a total of 1 mg for the two 
DNA molecules. Dosing with the polypeptide component may be similarly varied, for 
example, using a polypeptide component having two immunogenic HIV polypeptides 
the dose of each polypeptide per subject may be 100 micrograms of each 

25 immunogenic HIV polypeptide, for a total of 200 ug for the two polypeptides, 50 
micrograms of each iimnunogenic HIV polypeptide, for a total of 100 ug for the two 
polypeptides, or 25 ug of each immunogenic HIV polypeptide, for a total of 50 ug for 
the two polypeptides. As described above, more than two polypeptides may be 
included in the polypeptide component of the present invention. 

30 In further embodiments, the polynucleotide component of the present invention 

may comprise one or more gene delivery vectors comprising the polynucleotide(s) 
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encoding immunogenic HIV po]ypeptide(s). The polypeptide component of the 
present invention may comprise an adjuvant in addition to the immunogenic 
polypeptide(s). The present invention also comprises a method for generating an 
immune response in a subject, the method comprising, administering the 
5 polynucleotide composition to the subject under conditions that are compatible with 
expression of the polynucleotide(s) encoding immunogenic HIV polypeptide(s) in the 
subject, and administering the polypeptide composition to the subject. The 
administering of polynucleotide and polypeptide compositions may be concurrent or 
sequentially. In a preferred embodiment immunization with a polynucleotide 
10 component precedes immunization with at least one polypeptide component. Further, 
a single prime may be followed by multiple boosts or a series of primes and boosts 
may be used. 

Exemplary envelope proteins, and coding sequences thereof, for use in the 
present invention include, but are not limited to, gpl20, gpl40, oligomeric gpl40, and 

15 gp 1 60, including mutated or modified forms thereof (e.g., deletion of the V2 loop, 
mutations in cleavage sites, or mutations in glycosylation sites). In one embodiment, 
HTV envelope polypeptides that have been modified to expose the region of their 
polypeptide that binds to the CCR5 receptor are useful in the practice of the present 
invention, as well as polynucleotide sequences encoding such polypeptides. From the 

20 perspective of humoral immunity, it is useful to generate an immune response that 
provides neutralization of primary isolates that utihze the CCR5 chemokine co- 
receptor, which is believed to be important for virus entry (Zhu, T., et al. (1993) 
Science 261:1 179-1181; Fiore, J., et al. (1994) Virology; 204:297-303). These and 
other exemplary polynucleotide constructs (e.g.^ a variety of envelope protein coding 

25 sequences), methods of making the polynucleotide constructs, corresponding 
polypeptide products, and methods of making polypeptides useful for HIV 
immunization have been previously described, for example, in the following: PCT 
International Publication Nos.: WO/00/39302; WO/00/39304; WO/02/04493; 
WO/03/004657; WO/03/004620; and WO/03/020876; US Patent No. 6,602,705; and 

30 US Published Patent Application Nos. 20030143248 , and 20020146683. 
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Although described with reference to HIV subtypes B and C as exemplary 
subtypes, the compositions and methods of the present invention are applicable to a 
wide variety of HIV subtypes, serotypes, or strains and immunogenic polypeptides 
encoded thereby, including but not limited to the following: HTV-l subtypes, A 
5 through K, N and 0, the identified CRFs (circulating recombinant forms), and HIV-2 
strains and its subtypes. See, e.g., Myers, et al., Los Alamos Database, Los Alamos 
National Laboratory, Los Alamos, New Mexico; Myers, et al., Human Retroviruses 
and Aids, 1990, Los Alamos, New Mexico: Los Alamos National Laboratory. 
Further modifications of Env include, but are not limited to, generating 

10 polynucleotides that encode Env polypeptides having mutations and/or deletions 

therein. For instance, some or all of hypervariable regions, VI, V2, V3, V4 and/or V5 
can be deleted or modified as described herein, particularly regions VI , V2, and V3 . 
VI and V2 regions may mask CCR5 co-receptor binding sites. (See, e.g., Moulard, et 
al. (2002) Proc. Nat'l Acad. Sci 14:9405-9416). Accordingly, in certain embodiments, 

15 some or all of the variable loop regions are deleted, for example to expose potentially 
conserved neutraUzing epitopes. Fiuther, deglycosylation of N-linked sites are also 
potential targets for modification inasmuch as a high degree of glycosylation also 
serves to shield potential neutralizing epitopes on the surface of the protein. 
Additional optional modifications, used alone or in combination with variable region 

20 deletes and/or deglycosylation modification, include modifications (e.g., deletions) to 
the beta-sheet regions (e.g., as described in WO 00/39303), modifications of the leader 
sequence (e.g., addition of Kozak sequences and/or replacing the modified wild type 
leader with a native or sequence-modified tpa leader sequence) and/or modifications to 
protease cleavage sites (e.g., Chakrabarti, et al., (2002)7. Virol. 76(ll):5357-5368 

25 describing a gpl40 Delta CFI containing deletions in the cleavage site, fusogenic 
domain of gp41, and spacing of heptad repeats 1 and 2 of gp41 that retained native 
antigenic conformational determinants as defined by binding to known monoclonal 
antibodies or CD4, oligomer formation, and virus neutralization in vitro). 

Various combinations of these modifications can be employed to generate 

30 wild-type or synthetic polynucleotide sequences as described herein. 
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Modification of the Env polypeptide coding sequences may result in (1) 
improved expression relative to the wild-type coding sequences in a number of 
mammalian cell lines (as well as other types of cell lines, including, but not limited to, 
insect cells), and/or (2) improved presentation of neutralizing epitopes. Similar Env 
5 polypeptide coding sequences can be obtained, modified and tested for improved 
expression from a variety of isolates. 

Any of the polynucleotides (e.g., expression cassettes) or polypeptides 
described herein (delivered by any of the methods described above) can also be used 
in combination with other DNA delivery systems and/or protein delivery systems. 

1 0 Non-limiting examples include co-administration of these molecules, for example, in 
prime-boost methods where one or more molecules are delivered in a "priming" step 
and, subsequently, one or more molecules are delivered in a "boosting" step. In 
certain embodiments, the delivery of one or more nucleic acid-containing 
compositions is followed by delivery of one or more nucleic acid-containing 

1 5 compositions and one or more polypeptide-containing compositions (e.g., 

polypeptides comprismg HIV antigens). In other embodiments, multiple nucleic acid 
"primes" (of the same or different nucleic acid molecules) can be followed by multiple 
polypeptide "boosts" (of the same or different polypeptides). Other examples include 
multiple nucleic acid administrations and multiple polypeptide administrations. 

20 In any method involving co-administration, the various compositions can be 

delivered in any order. Thus, in embodiments including delivery of multiple different 
compositions or molecules, the nucleic acids need not be all delivered before the 
polypeptides. For example, the priming step may include delivery of one or more 
polypeptides and the boosting comprises delivery of one or more nucleic acids and/or 

25 one more polypeptides. Multiple polypeptide administrations can be followed by 
multiple nucleic acid administrations or polypeptide and nucleic acid administrations 
can be performed in any order. Thus, one or more or the nucleic acid molecules (e.g., 
expression cassettes) described herein and one or more of the polypeptides described 
herein can be co-administered in any order and via any administration routes. 

3 0 Therefore, any combination of polynucleotides and polypeptides described herein can 
be used to elicit an immune reaction. 
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In addition, following prime-boost regimes (such as those of the present 
invention described herein) may be beneficial to help reduce viral load in infected 
subjects, as well as possibly slow or prevent progression of HlV-related disease 
(relative to untreated subjects). 

Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 

Example 1 

Generation of Synthetic Expression Cassettes 
A. Generating Synthetic Polynucleotides 

The polynucleotide sequences used in the practice of the present invention are 
typically manipulated to maximize expression of their gene products in a desired host 
or host cell. Following here is some exemplary guidance concerning codon 
optimization and functional varients of HIV polypeptides. The order of the following 
steps may vary. 

First, the HIV-1 codon usage pattern may be modified so that the resulting 
nucleic acid coding sequence is comparable to codon usage found in highly expressed 
human genes. The HIV codon usage reflects a high content of the nucleotides A or T 
of the codon-triplet. The effect of the HIV-1 codon usage is a high AT content in the 
DNA sequence that results in a high AU content in the RNA and in a decreased 
translation ability and instability of the mRNA. In comparison, highly expressed 
human codons prefer the nucleotides G or C. Wild-type polynucleotide sequences 
encoding polypeptides are typically modified to be comparable to codon usage found 
in highly expressed human genes. 
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Second, for some genes variants are created (e.g., mutated forms of the wild- 
type polypeptide). In the following table (Table 2) mutations affecting the activity of 
several HIV genes are disclosed. 



Table 2 



Gene 


"Region" 


Exemplary Mutations 


Pol 


prot 


Att = Reduced activity by attenuation of Protease 
(Thr26Ser) (e.g., Konvalinka et al., 1995, J Virol 69: 

7180-86) 

Ina = Mutated Protease, nonfunctional enzyme 
(Asp25Ala)(e.g., Konvalinka et al., 1995, J Virol 69: 
7180-86) 




RT 


YM = Deletion of catalytic center (YMDD_AP; SEQ ID 
N0:7) (e.g., Biochemistry, 1995, 34, 5351, Patel et. al.) 
WM = Deletion of primer grip region (WMGY_PI; SEQ 
ID N0:8) (e.g., J Biol Chem, 272, 17, 1 1 157, 
Palaniappan, et. al., 1997) 




RNase 


no direct mutations, RnaseH is affected by "WM" 
mutation in RT 




Integrase 


1) Mutation of HHCC domain, Cys40Ala (e.g., 
Wiskerchen et. al., 1995, J Virol, 69: 376). 
2.) Inactivation catalytic center, Asp64Ala, Aspl 16Ala, 
Glul52Ala (e.g., Wiskerchen et. al., 1995, J Virol, 69: 

376). 

3) Inactivation of minimal DNA binding domain 
(MDBD), deletion of Trp235(e.g., Ishikawa et. al., 1999, 
J Virol, 73: 4475). 

Constructs int.opt.mut.SF2 and int.opt.mut_C (South 
Africa TVl) both contain all these mutations (1, 2, and 
3) 
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Gene "Region" 


Exemplary Mutations 


Env 


Mutations in cleavage site (e.g., Earl et al. (1990) PNAS 
USA 87:648-652; Earl et al. (1991) J. Virol. 65:31-41). 

Mutations in glycosylation site (e.g., GM mutants, for 
example, change Q residue in VI and/or V2 to N 
residue; may also be designated by residue altered in 
sequence) 

Deletions or modifications of the VI, V2, V3, V4 or V5 
regions or combinations thereof (See e.g., US 6602705) 

Deletions or modifications of the |3-sheets regions. (See 
e.g., WO 00/39303) 


Tat 


Mutants of Tat in transactivation domain (e.g., Caputo et 
al., 1996, Gene Ther. 3:235), e.g., cys22 mutant 
(Cys22Gly), cys37 mutant (Cys37Ser), and double 
mutants 


Rev 


Mutations in Rev domains (e.g., Thomas et al., 1998, J 

Virol. 72:2935-44), e.g., mutation in RNA binding- 
nuclear localization ArgArg38,39AspLeu, mutations 
in activation domain LeuGlu78,79AspLeu = MIO 


Nef 


Mutations of myristoylation signal and in 
oligomerization domain, for example: 

1. Single point mutation myristoylation signal: 
Gly-to-Ala 

2. Deletion of N-terminal first 18 (subtype B, e.g., 
SF162) or 19 (subtype C, e.g.. South Africa clones) 
amino acids. 

(e.g., Peng and Robert-Guroff, 2001, Immunol Letters 
78: 195-200) 

Single point mutation ohgomerization: 
(e.g., Liu et al., 2000, J Virol 74: 5310-19) 

Mutations affecting (1) infectivity (replication) of HIV- 
virions and/or (2) CD4 down regulation. (e.g., 
Lundquist et al. {im) J Virol. 76(9):4625-33) 


Vif 


Mutations of Vif: 

e.g., Simon et al., 1999, J Virol 73:2675-81 
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Gene "Region" 


Exemplary Mutations 


Vpr 


Mutations of Vpr: 

e.g., oingn ei di., zuwu, j v iiui /t. iuujv-j / 


Vpu 


Mutations of Vpu: 

e.g., Tiganos et al, 1998, Virology 251: 96-107 



Exemplary polynucleotides comprising some of these mutations have been 
previously described ( see, e.g., PCI International Publication Nos.: WO/00/39302; 
WO/00/39303; WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and 

5 WO/03/020876). Reducing or eliminating the function of the associated gene 

products can be accomplished employing the teachings set forth in the above table, in 
view of the teachings of the present specification. 

hi one aspect, the present invention comprises Env coding sequences that 
include, but are not limited to, polynucleotide sequences encoding the following HIV- 

10 encoded polypeptides: gpl60, gpl40, and gpl20 (see, e.g., U.S. Patent No. 5,792,459 
for a description of the HIV-1sf2 ("SF2") Env polypeptide). The relationships 
between these polypeptides is shown schematically in Figure 3 (in the figure: the 
polypeptides are indicated as lines, the amino and carboxy termini are indicated on the 
gpl60 line; the open circle represents the oligomerization domain; the open square 

1 5 represents a transmembrane spanning domain (TM); and "c" represents the location of 
a cleavage site, in gpl40.mut the "X" indicates that the cleavage site has been mutated 
such that it no longer functions as a cleavage site). The polypeptide gpl60 includes 
the coding sequences for gpl20 and gp41. The polypeptide gp41 is comprised of 
several domains including an oligomerization domain (OD) and a transmembrane 

20 spanning domain (TM). In the native envelope, the oligomerization domain is 

required for the non-covalent association of three gp41 polypeptides to form a trimeric 
structure: through non-covalent interactions with the gp41 trimer (and itself), the 
gpl20 polypeptides are also organized in a trimeric structxu-e. A cleavage site (or 
cleavage sites) exists approximately between the polypeptide sequences for gpl20 and 

25 the polypeptide sequences corresponding to gp41 . This cleavage site(s) can be mutated 
to prevent cleavage at the site. The resulting gpl40 polypeptide corresponds to a 
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truncated form of gpl60 where the transmembrane spanning domain of gp41 has been 
deleted. This gpl40 polypeptide can exist in both monomeric and ohgomeric (i.e. 
trimeric) forms by virtue of the presence of the ohgomerization domain in the gp41 
moiety. In the situation where the cleavage site has been mutated to prevent cleavage 

5 and the transmembrane portion of gp4 1 has been deleted the resulting polypeptide 
product is designated "mutated" gpl40 (e.g., gpHO.mut). As will be apparent to those 
in the field, the cleavage site can be mutated in a variety of ways, (See, also, e.g., PCT 
International Publication Nos. WO 00/39302 and WO/02/04493). 

Wild-type HIV coding sequences (e.g., Gag, Env, Pol, tat, rev, nef, vpr, vpu, 

1 0 vif, etc.) can be selected from any known HIV isolate and these sequences 

manipulated to maximize expression of their gene products following the teachings of 
the present invention. The wild-type coding region maybe modified in one or more of 
the following ways: sequences encoding hypervariable regions of Env, particularly VI 
and/or V2 are deleted, and/or mutations are introduced into sequences, for example, 

1 5 encoding the cleavage site in Env to abrogate the enzymatic cleavage of oligomeric 
gpl40 into gpl20 monomers. (See, e.g.. Earl et al. (1990) PNAS USA 87:648-652; 
Earletal. (1991) J. Virol. 65:31-41). hi yet other embodiments, hypervariable 
region(s) are deleted, N-glycosylation sites are removed and/or cleavage sites mutated. 
As discussed above, different mutations maybe introduced into the coding sequences 

20 of different genes (see, e.g., Table 2). 

To create the synthetic coding sequences of the present invention the gene 
cassettes are designed to comprise the entire coding sequence of interest. Synthetic 
gene cassettes are constructed by oUgonucleotide synthesis and PGR amplification to 
generate gene fragments. Primers are chosen to provide convenient restriction sites 

25 for subcloning. The resulting fragments are then ligated to create the entire desired 
sequence which is then cloned into an appropriate vector. The final synthetic 
sequences are (i) screened by restriction endonuclease digestion and analysis,(ii) 
subjected to DNA sequencing in order to confirm that the desired sequence has been 
obtained and (iii) the identity and integrity of the expressed protein confirmed by 

30 SDS-PAGE and Western blotting. The synthetic coding sequences are assembled at 
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Chiron Corp. (Emeryville, CA) or by the Midland Certified Reagent Company 
(Midland, Texas). 

Percent identity to the synthetic sequences of the present invention can be 
determined, for example, using the Smith- Waterman search algorithm (Time Logic, 
Incline Village, NV), with the following exemplary parameters: weight matrix = 
nuc4x4hb; gap opening penalty = 20, gap extension penalty = 5, reporting threshold = 
1; alignment threshold = 20. 

Various forms of the different embodiments of the present invention {e.g., 
constructs) may be combined. 

Some exemplary embodiments of synthetic polynucleotides useful in the 
practice of the present invention are discussed in Example 4 and presented in Figure 6 
to Figure 19. 

R Creating Expression Cassettes Comprising the Synthetic Polynuc leotides of the 
Present hivention 

The synthetic DNA fragments of the present invention may be cloned into a 
number of viral or non-viral expression vectors. For example, polynucleotides used in 
the practice of the present invention may be cloned into the foUov^dng non-viral 
expression vectors: (i) pCMVKm2, for transient expression assays and DNA 
immunization studies, the pCMVKm2 vector was derived from pCMV6a (Chapman et 
al., Nuc. Acids Res. (1991) 19:3979-3986) and comprises a kanamycin selectable 
marker, a ColEl origin of replication, a CMV promoter enhancer and Intron A, 
followed by an insertion site for the synthetic sequences described below followed by 
a polyadenylation signal derived from bovine growth hormone ~ the pCMVKm2 
vector differs from the pCMV-link vector only in that a polylinker site was inserted 
into pCMVKm2 to generate pCMV-link; (ii) pESN2dhfr and pCMVPLEdhfr (also 
known as pCMVIII), for expression in Chinese Hamster Ovary (CHO) cells; and, (iii) 
pAcC13, a shuttle vector for use in the Baculovirus expression system (pAcC13, was 
derived from pAcC12 which was described by Munemitsu S., et al., Mol Cell Biol. 
10(1 1):5977-5982, 1990). See, also PCT hitemational Publication Nos. WO 
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00/39302, WO 00/39303, WO 00/39304, WO 02/04493 for a description of these 
vectors. 

Briefly, construction of pCMVPLEdhfr (pCMVIII) was as follows. To 
construct a DHFR cassette, the EMCV IRES (internal ribosome entry site) leader was 
5 PCR-amplified from pCite-4a+ (Novagen, Inc., Milwaukee, WI) and inserted into 
pET-23d (Novagen, Inc., Milwaukee, WI) as mXba-Nco fragment to give pET- 
EMCV. The dhfr gene was PCR-amplified from pESN2dhfr to give a product with a 
Gly-Gly-Gly-Ser spacer in place of the translation stop codon and inserted as an Nco- 
BaniRX fragment to give pET-E-DHFR. Next, the attenuated neo gene was PGR 

10 amplified from a pSV2Neo (Clontech, Palo Alto, CA) derivative and inserted into the 
unique Bamm site of pET-E-DHFR to give pET-E-DHFR/NeO(n^). Then, the bovine 
growth hormone terminator from pCDNA3 (hivifrogen, Inc., Carlsbad, CA) was 
inserted downstream of the neo gene to give pET-E-DHFR/Neo(m2)BGHt. The 
EMCW-dhfr/neo selectable marker cassette fragment was prepared by cleavage of 

1 5 pET-E-DHFR/NeO(m2)BGHt. The CMV enhancer/promoter plus Intron A was 

transferred from pCMV6a (Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986) as a 
Hindm-Sall fragment into pUC19 (New England Biolabs, Inc., Beveriy, MA). The 
vector backbone of pUC19 was deleted from the Ndel to the Sapl sites. The above 
described DHFR cassette was added to the construct such that the EMCV IRES 

20 followed the CMV promoter to produce the final construct. The vector also contained 
an amp"^ gene and an SV40 origin of replication. 

Expression vectors of the present invention may comprise one or more 
polynucleotide sequence encoding immunogenic polypeptides. When the expression 
cassette contains more than one coding sequence the coding sequences may all be in- 

25 frame to generate one polyprotein; alternatively, the more than one polypeptide coding 
sequences may comprise a polycistronic message where, for example, an IRES is 
placed 5' to each polypeptide coding sequence; fiirther, multiple promoters may be 
present to direct the expression of multiple coding sequences. 
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Example 2 

In Vivo Immunogenicitv in Mice of Synthetic HIV Expression Cassettes and 
Polypeptides Encoded Thereby 

5 A. Immunization 

To evaluate the immunogenicity of the compositions of the present invention 
used to induce immune response, a mouse study may be performed. The 
polynucleotide component (e.g., two pCMVlink-based plasmids each carrying codon 
optimized coding sequences for gpl40 with delV2, the first coding sequence derived 

1 0 from SF 1 62, subtype B, and the second coding sequence derived from TV 1 , subtype 
C), is diluted in a total injection volume of 100 \i\ using varying doses of DNA (0.02 - 
200^g). To overcome possible negative dilution effects of the diluted DNA, the total 
DNA concentration in each sample can be adjusted using the vector (e.g., pCMVlink) 
alone. Groups of 10-12 Balb/c mice (Charles River, Boston, MA) are intramuscularly 

15 immunized (50 ^1 per leg, intramuscular injection into the tibialis anterior) using 
varying dosages. 

The mice are then immunized with oligomeric, codon optimized, gpl40 with 
delV2, derived from SF162, subtype B, polypeptide at intervals following the DNA 
immunization using appropriate concentrations of polypeptide. 

20 

R Humoral Immune Response 

The humoral immune response is checked with a suitable anti-HIV antibody 
ELISAs (enzyme-linked intmiunosorbent assays) of the mice sera 0 and 2-4 week 
intervals post immunization. 

25 The antibody titers of the sera are determined by anti-HIV antibody ELISA. 

Briefly, sera from immunized mice are screened for antibodies directed against HIV 
envelope protein. ELISA microtiter plates are coated with 0.2 ^g of HIV envelope 
gpl40 protein per well overnight and washed four times; subsequently, blocking is 
done with PBS-0.2% Tween (Sigma) for 2 hours. After removal of the blocking 

30 solution, 100 ^il of diluted mouse serum is added. Sera are tested at 1/25 dilutions and 
by serial 3-fold dilutions, thereafter. Microtiter plates are washed four times and 
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incubated with a secondary, peroxidase-coupled anti-mouse IgG antibody (Pierce, 
Rockford, IL). ELISA plates are washed and 100 ^1 of 3, 3', 5, 5'-tetramethyl 
benzidine (TMB; Pierce) is added per well. The optical density of each well is 
measured after 15 minutes. The titers reported are the reciprocal of the dilution of 
5 serum that gave a half-maximum optical density (O.D.). 

The results of these assays are used to show the potency of the 
polynucleotide/polypeptide immunization methods of the present invention for the 
generation of an immune response in mice. 

10 C. Cellular Immune Response 

The frequency of specific cytotoxic T-lymphocytes (CTL) is evaluated by a 
standard chromium release assay of peptide pulsed Balb/c mouse CD4 cells. HIV 
protein-expressing vaccinia virus infected CD-8 ceils may be used as a positive 
control (w-protein). Briefly, spleen ceils (Effector cells, E) are obtained from the 

1 5 B ALB/c mice (immunized as described above). The cells are cultured, restimulated, 
and assayed for CTL activity against, e.g.. Envelope peptide-pulsed target cells (see, 
e.g.. Doe, B., and Walker, CM., AIDS 10(7):793-794, 1996, for a general description 
of the assay). Cytotoxic activity is measured in a standard ^^Cr release assay. Target 
(T) cells are cultured with effector (E) cells at various E:T ratios for 4 hours and the 

20 average cpm from duplicate wells is used to calculate percent specific ^^Cr release. 
Antigen specific T cell responses in immunized mice can also be measured by flow 
cytometric determinations of infracellular cytokine production (Cytokine flow 
Cytometry or "CFC") as described by zur Megede, J., et al.„ in Expression and 
immunogenicity of sequence-modified human immunodeficiency virus type 1 subtype 

25 B pol and gagpol DNA vaccines, J Virol. 77:6197-207 (2003). 

Cytotoxic T-cell (CTL) or CFC activity is measured in splenocytes recovered 
from the mice immunized with HIV DNA constructs and polypeptides as described 
herein. Effector cells from the immunized animals typically exhibit specific lysis of 
HIV peptide-pulsed SV-BALB (MHC matched) targets cells indicative of a CTL 

30 response. Target cells that are peptide-pulsed and derived from an MHC-unmatched 
mouse strain (MC57) are not lysed. The results of the CTL or CFC assays are used to 
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show the potency of the polynucleotide/polypeptide immunization methods of the 
present invention for induction of cytotoxic T-lymphocyte (CTL) responses by DNA 
immunization. 

5 Example 3 

In Vivo Immunogenicity Studies 
A. General Immunization Methods 

To evaluate the immune response generated using the compositions 
(comprising a polynucleotide component and a polypeptide component) and methods 

10 of the present invention, studies using guinea pigs, rabbits, mice, rhesus macaques 
and/or baboons may be performed. The studies are typically structured as shown in 
the following table (Table 3) and can be carried out using, for example, the following 
components: Subtype B DNA - pCMVlink carrying a codon optimized coding 
sequences for gpl40 with delV2, the coding sequence derived from SF162, subtype B; 

1 5 Subtype C DNA ~ pCMVlink carrying a codon optimized coding sequences for gpl40 
with delV2, the coding sequence derived from TVl, subtype C; Subtype B protein ~ 
oligomeric, codon optimized, gpl40 with delV2, derived from SF162, subtype B 
polypeptide; and Subtype C protein ~ oligomeric, codon optimized, gpl40 with 
delV2, derived from TVl, subtype C polypeptide. 

20 Tables 



DNA 
Immunization 


Protein Immunization 


Subtype 
B 


Subtype 

C 


Subtype B & C 
(IX) 


Subtype B & C 
(2X) 


Subtype B 


X 


X 


X 


X 


Subtype C 


X 


X 


X 


X 


Subtype B & C 
(IX) 


X 


X 


X 


X 


Subtype B & C 
(2X) 


X 


X 


X 


X 



The immunizations may use single or multiple DNA immunizations and single 
or multiple protein immimizations. The immunizations in the above table exemplify 
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the following methods: prime/boost regimens (Subtype B DNA/Subtype B protein; 
Subtype C DNA/Subtype C protein); mixed prime/boost, single DNA prime and single 
-protein boost (Subtype B DNA/Subtype C protein; Subtype C DNA/Subtype B 
protein); mixed DNA prime single protein boost (Subtype B & C DNA/Subtype B 

5 protein; Subtype B & C DNA/Subtype C protein); single DNA prime mixed protein 
boost (Subtype B DNA/Subtype B & C protein; Subtype C DNA/Subtype B & C 
protein); and mixed DNA prime mixed protein boost (Subtype B& C DNA/Subtype B 
& C protein. The amount of each DNA and /or protein in the mixed samples (i.e, B <& 
C, in this example) can be added at an amount equal to that delivered in the single 

10 imuunizations (such that 2X the amount of total DNA and/or protein is delivered) or 
the amount of each DNA and/or protein in the mixed samples can be adjusted so that 
the same total amount (IX) of DNA and/or protein is delivered in the mixed and 
single samples. 

hi addition to examples in Table 3 exemplifying combinations of 
1 5 polynucleotide component and polypeptide component, other combinations 

exemplifying two polynucleotide or two polypeptide components can be mentioned. 
For example, continuing the above example using combinations of HIV subtype B and 
subtype C immunogens, the present invention also includes single DNA prime and 
single DNA boost (Subtype B DN/VSubtype C DNA); single protein prime and single 
20 protein boost (Subtype B protein/Subtype C protein). 

B. Mice 

Experiments may be performed in mice following the immunization protocol 
illustrated in Table 3 and using the methods essentially as described in Example 2. 

25 

C. Guinea Pigs 

Experiments may be performed in guinea pigs as follows. Groups comprising 
six guinea pigs each are immunized parenterally (e.g., intramuscularly or 
intradermally) or mucosally at 0, 4, and 12 weeks with plasmid DNAs comprising 
30 expression cassettes comprising one or more HIV immunogenic polypeptide (for 

example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. A subset 
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of the animals are subsequently boosted at approximately 12-24 weeks with a single 
dose (intramuscular, intradermally or mucosally) of the HIV protein(s) (for example, 
gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. Animals may be 
boosted subsequently multiple times at 8-16 week intervals with the HIV protein. 
5 Antibody titers (geometric mean titers) are measured at two weeks following the third 
DNA immunization and at two weeks after the protein boost. Results of these studies 
are used to demonstrate the usefulness of the compositions and methods of the 
invention to generate immune responses, in particular to generate broad and potent 
neutralizing activity against diverse HIV strains. 

10 

D. Rabbits 

Experiments may be performed in rabbits as follows. Rabbits are immunized 
intramuscularly or intradermally at multiple sites (using needle injection with or 
without subsequent electroporation, or using a Bioject needless syringe) or mucosally 

1 5 with with plasmid DNAs comprising expression cassettes comprising one or more 

HIV immunogenic polypeptide (for example, gp 140 DNAs as descibed in Example 2) 
as illustrated in Table 3, A subset of the animals are subsequently boosted with a 
single dose (intramuscular, intradermally or mucosally) of the HIV protein(s) (for 
example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. Animals 

20 may be boosted multiple times with the HIV protein. Typically, the compositions of 
the present invention used to generate inmiune responses are highly immunogenic and 
generate substantial antigen binding antibody responses after only 2 immunizations in 
rabbits. Results of these studies are used to demonstrate the usefiilness of the 
compositions and methods of the invention to generate immune responses, in 

25 particular to generate broad and potent neutralizing activity against diverse HIV 
strains. 



E. Rhesus Macaques 

Experiments may be performed in rhesus macaques as follows. Rhesus 
30 macaques are immunized at approximately 0, 4, 8, and 24 weeks parenterally or 
mucosally with plasmid DNAs comprising expression cassettes comprising one or 
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more HIV immunogenic polypeptide (for example, gp 140 DNAs as descibed in 
Example 2) as illustrated in Table 3. Enhanced DNA delivery systems such as use of 
DNA complexed to PLG microparticles or saline injection of DNA followed by 
electropoartion can be employed to increase immune response during the DNA 

5 priming phase of the immimization regimen. A subset of the animals are subsequently 
boosted with a single dose (intramuscular, intradermally or mucosally) of the HIV 
protein(s) (for example, gp 140 DNAs as descibed in Example 2) as illustrated in 
Table 3. Animals may be boosted multiple times generally at 3-6 month intervals with 
the HIV protein. Typically, the macaques have detectable HlV-specific T-cell 

10 responses as measured by CTL assays or Cytokine Flow Cytometry after two or three 
1 mg doses of the polynucleotide component. Neutralizing antibodies may also 
detected. Results of these studies are used to demonstrate the usefuhiess of the 
compositions and methods of the invention to generate immune responses, in 
particular to generate broad and potent neutraUzing activity against diverse HIV 

15 strains. 

F. Baboons 

Baboons are immunized 4 times (at approximately weeks 0, 4, 8, and 24) 
intramuscular, or intradermally, or mucosally with plasmid DNAs comprising 

20 expression cassettes comprising one or more HIV immunogenic polypeptide (for 

example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. The DNAs 
can be delivered in saline with or without electroporation, or on PLG microparticles. A 
subset of the animals are subsequently boosted with a single dose (intramuscular, 
intradermally or mucosally) of the HIV protein(s) (for example, gp 140 DNAs as 

25 descibed in Example 2) as illustrated in Table 3. Animals may be boosted multiple 
times generally at 3-6 month intervals with the HIV protein. 

The animals are bled 2-4 weeks after each immimization and an HIV antibody 
ELISA is performed with isolated plasma. The ELISA is performed essentially as 
described below in Section G except the second antibody-conjugate is typically an 

30 anti-human IgG, g-chain specific, peroxidase conjugate (Sigma Chemical Co., St. 

Louis, MD 63 1 78) used at a dilution of 1 :500. Fifty ^ig/ml yeast extract may be added 
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to the dilutions of plasma samples and antibody conjugate to reduce non-specific 
background due to preexisting yeast antibodies in the baboons. Lymphoproliferative 
responses to are typically observed in baboons post-boosting with HIV-polypeptide 
Such proliferation results are indicative of induction of T-helper cell functions. Results 
5 of these studies are used to demonstrate the usefiilness of the compositions and 
methods of the invention to generate immime responses, in particular to generate 
broad and potent neutralizing activity against diverse HIV strains. 

G. Humoral Immune Response 
10 In any immunized animal model (including the above, as well as, for example, 

chimpanzees), the humoral immune response is checked in serum specimens from the 

immunized animals with an anti-HIV antibody ELISAs (enzyme-linked 
immunosorbent assays) at various times post-immunization. The antibody titers of the 
sera are determined by anti-HIV antibody ELISA as described above. Briefly, sera 
1 5 from immunized animals are screened for antibodies directed against the HIV 

polypeptide/protein(s) encoded by the DNA and/or polypeptide used to immunize the 
animals (e.g., oligomeric gpl40). Typically independent ELISA assays are carried out 
using polypeptides corresponding to each of the subtypes used in the immunization 
study. 

20 Wells of ELISA microliter plates are coated overnight with the selected HIV 

polypeptide/protein and washed four times; subsequently, blocking is done with PBS- 
0.2% Tween (Sigma) for 2 hours. After removal of the blocking solution, 100 fil of 
diluted mouse serum is added. Sera are tested at 1/25 dilutions and by serial 3-fold 
dilutions, thereafter. Microliter plates are washed fotu- times and incubated with a 

25 secondary, peroxidase-coupled anti-mouse IgG antibody (Pierce, Rockford, IL). 
ELISA plates are washed and 100 \i\ of 3, 3', 5, 5'-tetramethyl benzidine (TMB; 
Pierce) was added per well. The optical density of each well is measured after 15 
minutes. Titers are typically reported as the reciprocal of the dilution of serum that 
gave a half-maximum optical density (O.D.). 

30 Cellular immune responses may also be evaluated. 
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The presence of neutralizing antibodies in the sera is determined essentially as 
follows: Virus neutralization is measured in 5.25.EGFP.Luc.M7 (M7-luc) cells 
obtained from Dr. Nathaniel Landau (Salk Institute, San Diego, CA). The format of 
this assay is essentially the same as the MT-2 assay as described elsewhere 
5 (Montefiori et al. (1988) J. Clin Microbiol. 26:23 1-235) except that virus infection is 
quantified by luciferase reporter gene expression using a commercial luciferase kit 
(Promega). All serum samples are heat-inactivated for 1 hour at 56°C prior to assay. 
The virus stocks of the HIV-1 isolates are typically generated in PBMC. 

10 Example 4 

Evaluation Of Immunogenicitv Regimens For Various HIV Polypeptide Encoding 
Plasmids Used As Primes And Various HIV Polypeptides Used As Boosts 
To evaluate the combination effects of subtype C (TVl) and subtype B 
(SF162) pgl40dV2 DNAs and proteins for DNA prime/boost the following 

1 5 experiments were carried out in rabbits. DNA was gp 140mod.TV 1 .dV2 and 

gpl40mod.SF162.dV2, delivered separately in two plasmids (sources of DNA are 
described further herein below). Protein was oligomer o-gpl40.dV2.TVl and o- 
gpl40.dV2.SF162 (sources of the proteins are described further herein below). DNA 
constructs were used for immunization in three doses at schedules of 0, 4, 12 weeks. 

20 Proteins were boosted at 12, 24, and 41 weeks. Each rabbit was injected 1.0 ml DNA 
mixture at two sides IM/Quadriceps, followed by an electroporation procedure (G. 
Widera, Increased DNA vaccine delivery and immunogenicity by electroporation in 
VIVO, J. Immunology, 164, 4635-4640 (2000)). MF59 adjuvanted protein was injected 
two sites, IM/Glut for 1ml per animal. 

25 All of the genes were sequence-modified to enhance expression of the encoded 

Env glycoproteins in a Rev-independent fashion and they were subsequently cloned 
into pCMV-based plasmid vectors for DNA vaccine and protein production 
appUcations as described above. The sequences were codon optimized as described 
herein. Briefly, all the modified envelope genes were cloned into the Chiron 

30 pCMVlink plasmid vector, preferably into EcoRI/XhoI sites. 
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To obtain gpl40 polypeptides each of the gpl40 contructs (i.e., 
gpl40mod.TVl.mut7.delV2 and gpl40.mut7.modSF162.delV2) were used in the 
following method. 

Chinese hamster ovary (CHO) cells were transfected with plasmid DNA 

5 encoding the gpl40 proteins (e.g., pCMV vector backbone) using Minis TransIT-LTl 
polyamine transfection reagent (Mirus Corporation, Madison WI) according to the 
manufacturer's instructions and incubated for 96 hours. After 96 hours, media was 
changed to selective media (F12 special with 250 ng/ml G418) and cells were split 1:5 
and incubated for an additional 48 hours. Media was changed every 5-7 days until 

10 colonies started forming at which time the colonies were picked, plated into 96 well 
plates and screened by gpl20 Capture ELISA. Positive clones were expanded in 24 
well plates and screened several times for Env protein production by Capture ELISA, 
as described above. After reaching confluency in 24 well plates, positive clones were 
expanded to T25 flasks (Coming, Coming, NY). These were screened several times 

1 5 after confluency and positive clones were expanded to T75 flasks. 

Positive T75 clones were frozen in liquid nitrogen and the highest expressing 
clones amplified with 0-5 ^iM methotrexate (MTX)at several concentrations and 
plated in 100 mm culture dishes. Plates were screened for colony formation and all 
positive closed were again expanded as described above. Clones were expanded, 

20 amplified and screened at each step by gpl20 capture ELISA. Positive clones were 
frozen at each methotrexate level. Highest producing clones were grown in perfiision 
bioreactors (3L, lOOL) for expansion and adaptation to low serum suspension culture 
conditions for scale-up to larger bioreactors. 

The stably transfected CHO cell lines, which express the Env polypeptides, 

25 were used to produce gpl40 proteins. The proteins were purified, briefly, by using a 
three-step sfrategy as previously described (Srivastava, et al., Purification and 
characterization of oUgomeric envelope glycoprotein from a primary r5 subtype B 
human immunodeficiency virus. J Virol 76:2835-47 (2002)). First, concentrated cell 
supematants were passed over a Galanthus Nivalis-agarose column (GNA; Vector 

30 Laboratories, Burlingame, CA). The gpl40SF162AV2 protein bound to the column, 
and most contaminating proteins flowed through. The bound protein was eluted with 
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500 mM methyl marmose pyranoside (MMP). Next, the captured protein was passed 
over DEAE and CHAP columns. 

These methods are applicable to other HTV genes and proteins derived from 
other HIV subtypes. Further, although this analysis was carried out in rabbits similar 
5 analysis may be carried out with other type of animals, for example, as described in 
Example 3. The immunization weeks can be varied. 

The following table (Table 4) lists exemplified procedures used in a 
comparison of the immunogenicity of subtype B and C polynucleotides encoding 
envelope polypeptides (in a pCMVlink vector) in various combinations with subtype 
10 B and C envelope polypeptides, both individually and as a mixed-subtype vaccine, 
using electroporation, in rabbits. It will be apparent to one skilled in the art in view of 
the teachings of the present specification that such methods are equally applicable to 
any other polynucleotides encoding immunogenic HIV polypeptides and 
immunogenic HIV polypeptides. 

15 

Table 4 



Gro 
."P 


Animal 

# 


Imm'n 

# 


Adjuvant 


Immunogen 


Total 
Dose 


Vol/ 
Site 


Sites/ 
Animal 


Route 


1 


1-4 


1,2,3, 
4 


MF59C 


o-gpl40 dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


2 


5-8 


1,2,3, 
4 


Iscomatrix 


o-gpl40dV2SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


3 


9-12 


1,2,3 




pCMV 140 dV2 SF162 
DNA 


l.Omg 


0.50ml 


2 


IM/Quad 
(Needle) 






3,4 


MF59C 


o-gpl40dV2SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


4 


13-16 


1,2,3 




pCMV 140dV2SF162 
DNA 


l.Omg 


O.Sml 


2 


IM/Quad 
(Needle) 






3,4 


Iscomatrix 


o-gpl40dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


5 


17-20 


1, 2,3,4 


MF59C 


o-gpl40 dV2 TVl 


50ug 


500ul 


2 


IM/Glut 
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Gro 
up 


Animal 

# 


Imm'n 

# 


Adjuvant 


Immunogen 


Total 
Dose 


Vol/ 
Site 


Sites/ 
Animal 


Route 


















(Needle) 


6 


21-24 


1,2,3 




pCMV 140 dV2 TVl 
DNA 


l.Omg 


0.5ml 


2 


IM/Quad 
(Needle) 






3,4 


MF59C 


o-gpl40 dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


7 


25-28 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV140dV2TVl 
DNA 

o-gpl40dV2SF162 


2.0mg 
(l.Omg 
ea.) 

50ug 


0.5ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


8 


29-32 


1,2,3 
3,4 


MF59C 


pCMV 140dV2SF162 
DNA 

pCMV 140 dV2 TVl 
DNA 

o-gpl40 dV2TVl 


2.0mg 
50ug 


0.5ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


9 


33-36 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140 dV2 TVl 
DNA 

o-gpl40dV2 SF162 
o-gpl40 dV2 TVl 


2.0mg 
lOOug 


0.50m] 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


10 


37-40 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140 dV2 TVl 
DNA 

o-gpl40 dV2 SF162 
o-gpl40 dV2 TVl 


2.0mg 
50ug 


0.5ml 
500ul 


2 

2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


11 


41-44 


1,2,3 
3,4 


MF59C 


pCMV140dV2 SF162 
DNA 

pCMV 140 dV2 TVl 
DNA 

o-gpl40 dV2 SF162 


l.Omg 
50ug 


0.50ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 



The MF59C adjuvant is a microfluidized emulsion containing 5% squalene, 
0.5% Tween 80, 0.5% Span 85, in lOmM citrate pH 6, stored in 10 ml aliquots at 4°C. 
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The Iscomatrix adjuvant is a quil saporin based adjuvant used for protein 
delivery (available from, e.g., CSL Limited, Victoria, Australia). 

The polynucleotides and polypeptides listed in Table 4 were prepared as 
described in Table 5. 
5 Table 5 



Polynucleotide Construct / 
Polypeptide 


Description 


pCMV 140 dV2 SF162 DNA 


The plasmid (pCMVlink) contained a synthetic, 
codon optimized HIV-1 gpl40 envelope gene 
from subtype B strain SF162 (see, 
gpl40.modSF162.delV2, Figure 6, see also 
PCT Intemational Publication No. 
WO/00/39302). The gpl40 gene comprised the 
gpl20 and gp41 ectodomain. The constructs 
also contained a deletion in the variable region 
V2 (dV2).The plasmid construct contained the 
human CMV enhancer/promoter and 
Kanamycin resistance gene. Plasmids were 
prepared by alkaline lysis method and Qiagen 
purification from DH5-DnE.coli bacteria. 
Plasmids were stored at -80C until use. 


pCMV 140 dV2 TVl DNA 


The plasmid (pCMVlink) contained a synthetic, 
codon optimized HIV-1 gpl40 envelope gene 
derived from HIV-l subtype C strain TVl (see, 
gpl40mod.TVl.delV2, Figure 8, see also PCT 
Intemational PubUcation No. WO/02/04493). 
The structure of the envelope gene and the 
plasmid was as described above. 


o-gpl40 dV2 SF162 protein 


The subtype B oligomer protein contained five 
amino acid mutations in the cleavage site in 
addition to the deletion of V2 region (see, 
gpl40.mut7.modSF162.delV2, Figure 7, see 
also PCT Intemational Publication No. 
WO/00/3930). Protein was expressed in CHO 
cells and purified from the CHO cells. 
Expression and purification of o-gpl40 proteins 
was described, for example, in PCT 
Intemational Publication No. WO/00/39302 and 
Srivastava, et al., J Virol 76:2835-47 (2002). 


o-gpl40 dV2 TVl protein 


The subtype C oligomer protein contained five 
amino acid mutations in the cleavage site in 
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Polynucleotide Construct / 
Polypeptide 


Description 




addition to the deletion of V2 region (see, 
gpl40mod.TVl.mut7.delV2, Figure 9, see also 
PCT International Publication No. 
WO/02/04493). Protein was expressed in CHO 
cells and purified fi-om the CHO cells. 
Expression and purification of o~gpl40 proteins 
was described, for example, in PCT 
International Publication No. WO/00/39302 and 
Srivastava, et al., J Virol 76:2835-47 (2002). 



Inununogens were prepared as described in the following table (Table 6) for 
administration to animals in the various groups. 

Table 6 

5 



Group 


Preparation 


1,5 


Immunization 1-4: Protein Immunization + MF59 

Protein doses were 50ug protein per animal. The initial protein was 
diluted to 0.100 mg/ml in citrate buffer. Stored at -80°C until use. 
Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 


2 


Immunization 1-4: Protein Immunization + Iscomatrix 
The stock concentration was 1 mg/ml. Immediately before 
iirununizations, 250ul of Img/ml Iscomatrix was diluted to 2.5ml of 
O.lmg/ml with PBS (CPU U21). Added equal volume (2.5ml) of 
O.lmg/ml Iscomatrix into 2.5ml of O.lmg/ml protein and mixed well. 
Immunized each rabbit with 0.5ml adjuvanted protein per side, IM/Glut 
for a total of 1ml per animal. 


3-4,6 


Immunization 1—3: Subtype B/C plasmid DNA in Saline 

The immunogen was provided at 1.0 mg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(IM/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of Ix dose DP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
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Group 


Preparation 


3,6 
4 


using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug each SF162 protein per animal. The initial 
SF162 Protein was diluted to 0.100 mg/ml in citrate buffer. Stored at - 
80°C until use. Thawed at room temperature; material was clear with no 
particulate matter. Added equal volume of MF59C adjuvant to thawed 
protein and mixed well by inverting the tube. Immunized each rabbit 
with 0.5ml adjuvanted protein per side, IM/Glut for a total of Iml per 
animal. Used material within 1 hour of the addition of adjuvant. Needles 
were used for injections. 

Immunization 3-4: Protein immunization 

The stock concentration was 1 mg/ml. Immediately before 
immunizations, Iscomatrix was diluted to 0.1 mg/ml with PBS (CFU 
U21). Added equal volume of O.lmg/ml Iscomatrix into the 0.1 mg/ml 
protein and mixed well. Immunized each rabbit with 0.5ml adjuvanted 
protein per side, IM/Glut for a total of 1ml per animal. 


7-8, 10 


Immunization 1-3: Subtype B/C plasmid DNA in Saline 
The unmunogen was provided at 2.0mg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(EM/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of Ix dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug protein per animal. The initial protein was 
diluted to 0.100 mg/ml in citrate buffer. Stored at -SO^'C until use. 
Thawed at room temperature; materia! was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, M/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 
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Group 


Preparation 


9 


ImmuDization 1-3: Subtype B plasmid DNA in Saline 

The immunogen was provided at l.Omg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(IM/Quadriceps), total 2 sides with 1 .0ml per animal. Animals were 
shaved prior to immunization, under sedation of Ix dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug each protein per animal, total lOOug. The initial 
protein was diluted to 0.200 mg/ml in citrate buffer. Stored at -80°C until 
use. Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by invertmg the tube, hnmunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 


11 


Immunization 1-3: Subtype B plasmid DNA in Saline 

The immunogen was provided at l.Omg/ml total DNA in sterile 0.9% 
saline. Stored at -SO^C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(IM/Quadriceps), total 2 sides with 1 .0ml per animal. Animals were 
shaved prior to immunization, under sedation of Ix dose IP (by animal 
weight) of Ketamine-Xylazine (SOmg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug protein per animal. The initial protein was 
diluted to 0.100 mg/ml in citrate buffer. Stored at -80°C until use. 
Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 
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Group 


Preparation 







The immunization (Table 7) schedules were as follows: 
Table 7 



Imm'n: 
Weeks: 
Group 


1 
0 


2 
4 


3 
12 


4 

24 


1 


Gpl40dV2 
SF162 + MF59C 


Gpl40dV2 SF162 
+ MF59C 


Gpl40dV2SF162 
+ MF59C 


Gpl40 dV2 
SF162 + MF59C 


2 
3 


Gpl40 dV2 
SF162 + 
Iscomatrix 
pCMV 140 dV2 
SF162DNA 


Gpl40 dV2 SF162 
+ Iscomatrix 

pCMV 140 dV2 
SF162 DNA 


Gpl40 dV2 SF162 
+ Iscomatrix 

pCMV 140 dV2 
SF162 DNA 
Gpl40dV2 SF162 
+ MF59C 


Gpl40dV2 
SF162 + 
Iscomatrix 
Gpl40 dV2 
SF162 + MF59C 


4 


pCMV 140 dV2 
SF162 DNA 


pCMV 140 dV2 
SF162 DNA 


pCMV 140 dV2 
SF162 DNA 
Gpl40dV2SF162 
+ Iscomatrix 


Gpl40 dV2 
SF162 + 
Iscomatrix 


5 


Gpl40dV2TVl 
+ MF59C 


Gpl40dV2TVl + 
MF59C 


Gpl40 dV2 TVl + 
MF59C 


Gpl40 dV2 TVl 
+ MF59C 


6 


PCMV 140 dV2 
TVl DNA 


pCMV140dV2 
TVl DNA 


pCMV140 dV2 
TVl DNA 
Gpl40dV2TVl + 
MF59C 


Gpl40dV2TVl 
+ MF59C 


7 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TVl DNA 


pCMV 140 dV2 
SF162DNA + 
PCMV 140 dV2 
TVl DNA 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TVl DNA 
Gpl40dV2SF162 
+ MF59C 


Gpl40 dV2 
SF162 + MF59C 


8 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TVl DNA 


pCMV 140 dV2 
SF162DNA + 
PCMV 140 dV2 
TVl DNA 


pCMV 140 dV2 
SF162DNA + 
PCMV 140 dV2 
TVl DNA 
Gpl40dV2TVl + 
MF59C 


Gpl40 dV2 TVl 
+ MF59C 
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Imm'n- 


1 


2 


3 


4 


Weeks' 


0 


4 


12 


24 


Group 










9 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40dV2 




SF162DNA + 


SF162 DNA + 


SF162 DNA + 


SF162 + MF59C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 


Gpl40 dV2TVl 




TVl DNA 


TVl DNA 


TVl DNA 


+ MF59C 








Gpl40 dV2 SF162 


(lOOug Prot.) 








+ MF59C 










Gpl40 dV2 TVl + 










MF59C (lOOug 










Prot.) 




10 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40dV2 




SF162DNA + 


SF162DNA + 


SF162DNA + 


SF162 + MF59C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 


Gpl40dV2TVl 




TVl DNA 


TVl DNA 


TVl DNA 


+ MF59C 








Gpl40dV2SF162 


(SOug Prot.) 








+ MF59C 










Gpl40 dV2 TVl + 










MF59C 










(SOug Prot.) 




11 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40 dV2 




SF162DNA + 


SF162DNA + 


SF162DNA + 


SF162 + MFS9C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 






TVl DNA 


TVl DNA 


TVl DNA (l.Omg) 










Gpl40dV2 SF162 










+ MF59C 






Note: all DNA 


Note: all proteins 








was l.Omg each 


were SOug each 








except group 11 


except group 10 








used O.Sing DNA 
each. 


used 25ug ea. 







Table 7 (cont.) 



Imm'n: 


5 


6 


Weeks: 


41 


56 


Group 








Gpl40dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 


2 


Gpl40 dV2 SF162 + 


Gpl40dV2 SF162 + 




Iscomatrix 


Iscomatrix 


3 


Gpl40 dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 


4 


Gpl40 dV2 SF162 + 


Gpl40 dV2 SF162 + 




Iscomatrix 


Iscomatrix 


5 


Gpl40 dV2TVl +MF59C 


Gpl40 dV2 TVl +MF59C 


6 


Gpl40 dV2 TVl +MF59C 


Gpl40dV2 TVl +MF59C 


7 


Gpl40dV2 SF162 + 


Gpl40dV2SF162 + 




MF59C 


MF59C 
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Imm'n: 


5 


6 


Weeks: 


41 


56 








8 


Gpl40 dV2 TVl + MF59C 


Gpl40 dV2 TVl + MF59C 


9 


Gpl40 dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 




Gpl40dV2TVl+MF59C 


Gpl40dV2TVl +MF59C 




(lOOugProt.) 


(lOOugProt.) 


10 


Gpl40dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 




Gpl40 dV2 TVl + MF59C 


Gpl40 dV2 TVl + MF59C 




(50ug Prot.) 


(50ug Prot.) 


11 


GpHO dV2 SF162 + 


GpHO dV2 SF162 + 




MF59C 


MF59C 




Note: all DNA was l.Omg 


Note: all proteins were 




each except group 11 


SOug each except group 




used CSmg DNA each. 


10 used 2Sug each. 



The bleeding (Table 8) schedules for all groups (A-F) were as follows: 
Table 8 



Bleed: 


0 


1 


2 


3 


4 


5 


6 


7 


Week: 


0 


2 


6 


8 


12 


14 


16 


24 


Sample: 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 




for Serum 


for Serum 


for Seram 


for Serum 


for Semm 


for Semm 


for Seram 


for Seram 


Bleed: 


8 


9 


10 


11 


12 


13 


14 


15 


Week: 


26 


28 


41 


43 


45 


56 


58 


60 


Sample: 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 




for Serum 


for Semm 


for Semm 


for Semm 


for Semm 


for Semm 


for Seram 


for Semm 



To evaluate the combination effects of subtype C (TVl) and subtype B 
(SF162) gpl40dV2 DNAs and proteins for DNA prime/boost on the generation of 
neutralizing antibody activity against HFV strain SF162 (subtype B) the following 

10 comparisons were carried out. 

Neutralizing antibody responses against PBMC-grown SF162 and TVl HIV-1 
strains were monitored in the sera collected from the immunized rabbits using the 
following assay conducted essentially as follows. Virus neutralization was measured 
in 5.25.EGFP.LUC.M7 (M7-luc) cells obtained from Dr. Nathaniel Landau (Salk 

15 histitute, San Diego, CA). The format of this assay was essentially the same as the 
MT-2 assay that has been described elsewhere (Montefiori, et al., /. Clin Microbiol. 
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26:23 1-235, 1988) except that virus infection was quantified by luciferase reporter 
gene expression using a commercial luciferase kit (Promega). All serum samples were 
heat-inactivated for 1 hour at 56°C prior to assay. The virus stocks of the HIV-1 
islolates were generated in PBMC. Neutralizing antibody titers are reported as 
5 reciprocal serum dilution at which 50% luciferase activity was measured in test wells 
as compared to virus control wells. Values shown in Figures 4 and 5 are the geometric 
mean titers plus standard errors of the neutralization titers for each group of animals. 

The results of the assays for the presence of neutralizing antibodies are 
presented in Figure 4 and Figure 5. In the figures, the following Immunization Groups 
1 0 correspond to the Groups in Table 4: B DNA + B prot; C DNA + B prot (Group 6); 
B+C DNA + B prot (Group 7); B+C DNA + C prot (Group 8); B+C DNA & prot 
(Group 9); B+C DNA. & prot (1/2) (Group 10); and, B+C DNA (1/2) + C prot (Group 
11). 

In Figure 4, the first vertical bar of each group of three bars is neutralizing 

15 activity against HIV-1 SF-162 in prebleed rabbit serum (Figure 4, Prebleed), the 
second vertical bar is serum from a bleed two weeks after the third immunization 
(Figure 4, 2 wk post 3^'^), and the third vertical bar is serum from a bleed two weeks 
after the fourth immunization (Figure 4, 2 wk post 4'*^). 

Figure 4 summarizes data showing the neutralization titers against HIV-1 

20 SF162 between the 7 groups described above. These results demonstrated that all 
groups showed strong neutralizing activity against the HIV-1 SF162 isolate. Further, 
neutralizing activity significantly increased at post 4* immunization compared to post 
3"* immunizations. Priming and boosting with B gene and B protein (B DNA + B 
prot) showed a high titer, as did the C gene and B protein (C DNA + B prot). For the 

25 mixed (B+C) DNA prime and single protein boost, B protein gave a high boost to the 
mixed gene prime (B+C DNA + B prot) and a boost to the C protein (B+C DNA + C 
prot). For the mixed DNA prime and protein boost, half dose (50ug) of protein (B+C 
DNA & prot (1/2)) induced high neutraUzing activity as did the fiiU dose of lOOug 
protein (B+C DNA & prot). The mixed DNA prime and single protein boost with 

30 subtype C protein, the half- dose (Img) DNA (B+C DNA + C prot) also gave 

neutralizing activity, as did the fiill-dose of 2mg DNA (B+C DNA (1/2) + C prot). 
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In Figure 5, the prebleed values for neutralizing activity against HIV-1 TVl in 
prebleed rabbit serum were less than one log for each group of bars (Figure 5, 
Prebleed), the grey vertical bars for each group are serum from bleeds two weeks after 
the fourth immunization (Figure 5, 2 wk post 4*). 
5 Figure 5 summarizes data showing the neutraUzation titers against HIV-1 TVl 

(South African Subtype C) between the 7 groups described above. These results 
demonstrated that all groups showed neutralizing activity against HIVl subtype C 
TVl isolate (as expected, because no Subtype C DNA or protein was used, the B DNA 
+ B protein showed the lowest neutralizing activity). For the mismatched a single 

10 DNA prime and a single protein boost (C DNA + B prot), priming with C gene and 
boosting with B protein showed a high titer, as did the B gene and B protein (B DNA 
+ B prot). For the mixed (B+C) DNA prime and single protein boost, use of either B 
(B+C DNA + B prot) and C (B+C DNA + C prot) proteins had a similar boosting 
effect. For the mixed DNA prime and protein boost, full dose of 1 OOug protein (B+C 

1 5 DNA & prot) induced high neutralizing activity, as did the half dose of 50ug protein 
(B+C DNA & prot (1/2)). The half-dose (Img) DNA (B+C DNA (1/2) + C prot) also 
gave neutralizing activity, as did the full-dose of 2mg DNA (B+C DNA + C prot). 

Comparison of the data presented in Figure 4 and Figure 5 supported the 
combination methods of the present invention for generating an immune response in a 

20 subject. Such a comparison showed that the combination of DNA derived from 

different subtypes primed broad responses to multiple strains from different subtypes. 
This may indicate the targeting common conserved epitopes. Further, use of a single 
subtype protein was sufficient to boost broad neutralizing responses when immunity 
was primed with multiple strains from different subtypes of DNA. The DNA priming 

25 maintained the native envelope structure. This can induce T cell responses in addition 
to the B cell response. Finally, these results demonsfrated that use of lower doses of 
proteins mixture can also provide sfrong immime responses. 

These studies demonstrated the usefulness of the compositions and methods of 
the invention to generate immune responses, in particular to generate broad and potent 

30 neutralizing activity against diverse HIV sfrains. 
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Although preferred embodiments of the subject invention have been described 
in some detail, it is understood that obvious variations can be made without departing 
from the spirit and the scope of the invention. The following embodiments are offered 
for illustrative purposes only, and are not intended to limit the scope of the present 
5 invention in any way. 
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Exemplary Embodiments of the Present Invention: 

1. A composition for generating an immune response in a mammal, said 
composition comprising, 

5 a polynucleotide component consisting essentially of one polynucleotide 

encoding an HfV immunogenic polypeptide, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
10 component is derived from a different HIV subtype than the subtype from which the 
immunogenic polypeptide encoded by the polynucleotide component is derived. 

2. A composition for generating an immune response in a mammal, said 
composition comprising, 

15 a polynucleotide component comprising two or more polynucleotide sequences 

comprising coding sequences for two or more analogous HIV immunogenic 
polypeptides, wherein the coding sequences for at least two of the HIV immunogenic 
polypeptides are derived from different HIV subtypes, and 

a polypeptide component comprising one or more HIV immunogenic 

20 polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that, if the polypeptide component comprises the same number or 
greater than the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HfV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV subtype than the HIV 

25 immunogenic polypeptides provided by the polynucleotide component. 

3. The composition of embodiments 1 or 2, wherein said polynucleotide 
component comprises at least one polynucleotide that is a native polynucleotide. 

30 4. The composition of embodiments 1 or 2, wherein said polynucleotide 

component comprises at least one polynucleotide that is a synthetic polynucleotide. 
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5. The composition of embodiment 4, wherein said synthetic polynucleotide 
comprises codons optimized for expression in mammalian cells. 

6. The composition of embodiment 5, wherein said synthetic polynucleotide 
comprises codons optimized for expression in human cells. 

7. The composition of embodiments 1 or 2, wherein the HIV immunogenic 
polypeptides are HIV envelope polypeptides. 

8. The composition of embodiment 7, wherein at least one of said HIV 
polypeptides comprises one or more mutations. 

9. The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises a mutation in the cleavage site or a mutation in the 
glycosylation site. 

10. The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of the VI region. 

1 1 . The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of the V2 region. 

12. The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of the V3 region. 

13. The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of regions selected fi:om the group 
consisting of the VI region, the V2 region, the V3 region, and combinations thereof. 
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14. The composition of embodiment 8, wherein at least one of said envelope 
polypeptides comprises envelope polypeptide modified to expose an envelope binding 
region that binds to a CCR5 chemokine co-receptor. 

5 15. The composition of embodiments 1 or 2, wherein at least one 

polynucleotide encoding an HTV immimogenic polypeptide encodes an immunogenic 
HIV polypeptide selected from the group consisting of: Gag, Env, Pol, Prot, Int, RT, 
vif, vpr, vpu, tat, rev, and nef. 

10 16. The composition of embodiments 1 or 2, wherein the HIV subtypes are 

selected fi-om the group consisting of: Subtype A, Subtype B, Subtype C, Subtype D, 
Subtype E, Subtype F, Subtype G, Subtype H, Subtype I, Subtype J, Subtype K , 
Subtype N, and Subtype O. 

15 17. The composition of embodiments 1 or 2, wherein at least one of said 

inununogenic HIV polypeptides comprises one or more mutations. 

18. The composition of embodiment 2, wherein said polynucleotide 
component further comprises a sequence encoding an additional antigenic 

20 polypeptide. 

19. The composition of embodiment 18, wherein said polypeptide component 
further comprises a polypeptide having an additional antigenic peptide. 

25 20. The composition of embodiments 1 or 2, wherein said polynucleotide 

component further comprises a sequence encoding an additional antigenic 
polypeptide, with the proviso that the additional antigenic polypeptide is not an 
immunogenic polypeptide derived from an HIV-1 sfrain. 

30 21. The composition of embodiment 20, wherein said polypeptide component 

ftirther comprises a polypeptide having an additional antigenic peptide. 
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22. The composition of embodiments 1 or 2, wlierein said polynucleotide 
component further comprises sequences encoding one or more control elements 
compatible with expression in a selected host cell, wherein said control elements are 

5 operable linked to polynucleotides encoding HIV immunogenic polypeptides. 

23. The composition of embodiment 22, wherein said control elements are 
selected from the group consisting of a transcription promoter, a transcription 
enhancer element, a transcription termination signal, polyadenylation sequences, 

1 0 sequences for optimization of initiation of translation, an internal ribosome entry site 
and translation termination sequences. 

24. The composition of embodiment 23, wherein said transcription promoter 
is selected from the group consisting of CMV, CMV+intron A, SV40, RSV, HIV-Ltr, 

1 5 MMLV-ltr, an alphavirus subgenomic promoter and metallothionein. 

25. A method of generating an immune response in a subject, comprising, 
providing the composition for generating an immune response in a mammal of 

embodiment 1 or embodiment 2; 
20 administering one or more gene delivery vectors comprising the 

polynucleotides of said polynucleotide component of the composition into said subject 
under conditions that are compatible with expression of said polynucleotides in said 
subject for the production of encoded HIV immunogenic polypeptides; and 
administering the polypeptide component to said subject. 

25 

26. The method of embodiment 25, wherein said one or more gene delivery 
vectors and said polypeptide component are introduced concurrently. 

27. The method of embodiment 26, wherein said one or more gene delivery 
30 vectors and said polypeptide component are introduced sequentially. 
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28. The method of embodiment 25, wherein said polypeptide component 
further comprises an adjuvant. 

29. The method of embodiment 25, wherein said polynucleotide component 
5 further comprises a carrier. 

30. The method of embodiment 25, wherein said one or more gene delivery 
vectors are nonviral vectors. 

10 31. The method of embodiment 25 , wherein said one or more gene dehvery 

vectors are delivered using a particulate carrier. 

32. The method of embodiment 3 1 , wherein said one or more gene 

delivery vectors are coated on a gold or tungsten particle and said 
1 5 coated particle is delivered to said subject using a gene gun. 

3 3 . The method of embodiment 3 1 , wherein said one or more gene 

delivery vectors are delivered using a PLG particle. 

20 34. The method of embodiment 29, wherein said one or more gene delivery 

vectors are encapsulated in a liposome preparation. 

35. The method of embodiment 25, wherein said one or more gene delivery 
vectors are viral vectors. 

25 

36. The method of embodiment 35, wherein said viral vectors are retroviral 

vectors. 

37. The method of embodiment 35, wherein said viral vector are lentiviral 

30 vectors. 
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38. The method of embodiment 35, wherein said viral vectors are aiphaviral 

vectors. 

39. The method of embodiment 25, wherein said subject is a mammal. 

5 

40. The method of embodiment 39, wherein said mammal is a human. 

41. The method of embodiment 25, wherein said immune response is a 
humoral immune response. 

10 

42. The method of embodiment 25, wherein said immune response is a 
cellular immune response. 

43. The method of embodiment 25, wherein said one or more gene delivery 
1 5 vectors are administered intramuscularly, intramucosally, intranasally, 

subcutaneously, intradermally, transdermally, intravaginally, intrarectally, orally or 
intravenously. 

44. The method of embodiment 25, wherein said immune response results in 
20 generating neutralizing antibodies in the subject against multiple HIV-subtypes. 
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COMBINATION APPROACHES FOR GENERATING IMMUNE 
RESPONSES 

Abstract of the Disclosure 

The present invention relates to methods, polynucleotides, and polypeptides 
encoding immunogenic HIV polypeptides. Uses of the polynucleotides and 
polypeptides in combination approaches for generating immune responses are 
described. The combination approaches described herein have been shown to induce 
broad and potent neutralizing activity against diverse HIV strains from multiple 
subtypes. Formulations of compositions for generating immune responses and 
methods of use for such compositions are also disclosed. 



FIGURE lA 



1 TGGAAGGGTT AATTTACTCC AAGAAAAGGC 
61 CACAAGGCTT CTTCCCTGAT TGGCAAAACT 
121 TGACCTTTGG ATGGTGCTAC AAGCTAGTGC 
181 ACGGAGGAGA AGACAACTGT TTGCTACACC 
241 ATAGAGAAGT ATTAAAGTGG AAGTTTGACA 
301 AGCTACATCC GGAGTATTAC AAAGACTGCT 
361 TCCACTGGGG CGTTCCGGGA GGTGTGGTCT 
421 ATGCTGCATA TAAGCAGCTG CTTTTCGCCT 
481 GAGCCTGGGA GCCCTCTGGC TATCTAGGGA 
541 CTTGAGTGCT TTAAGTAGTG TGTGCCCATC 
601 TCAGACCCTT TGTGGTAGTG TGGAAAATCT 
661 AGTGAAAGTG AGACCAGAGG AGATCTCTCG 
721 GGCAAGAGGC GAGAGGGGCG GCTGGTGAGT 
781 GAAGGAGAGA GATGGGTGCG AGAGCGTCAA 
841 AAAGAATTAG GTTAAGGCCA GGGGGAAAGA 
901 CAAGCAGGGA GCTGGAAAGA TTTGCACTTA 
961 GTAAACAAAT AATAAAACAG CTACAACCAG 
1021 CATTATTCAA CACAGTAGCA ACTCTCTATT 
1081 CCAAGGAAGC CTTAGACAAG ATAGAGGAAG 
1141 AGGCAAAAGC AGCTGACGAA AAGGTCAGTC 
1201 GGCAAATGGT ACACCAAGCT ATATCACCTA 
12 61 AGGAAAAGGC TTTCAATCCA GAGGAAATAC 
1321 CCCCACAAGA TTTAAACACA ATGTTAAATA 
1381 TGTTAAAAGA TACCATCAAT GAGGAGGCTG 
1441 CAGGGCCTGT TGCACCAGGC CAGATGAGAG 
1501 CTAGTACCCT TCAGGAACAA ATAGCATGGA 
1561 ACATCTATAA AAGATGGATA ATTCTGGGGT 
1621 TTAGCATTTT GGACATAAAA CAAGGGCCAA 
1681 TCTTTAAAAC CTTAAGAGCT GAACAAGCTA 
1741 CCTTGTTGGT CCAAAATGCG AACCCAGATT 
1801 GGGCCTCATT AGAAGAAATG ATGACAGCAT 
1861 CAAGAGTGTT GGCTGAGGCA ATGAGCCAAG 
1921 ATTTTAAAGG CTCTAACAGA ATTATTAAAT 
1981 CCAGAAATTG CAGGGCCCCT AGGAAAAAGG 
2041 AAATGAAAGA CTGTACTGAG AGGCAGGCTA 
2101 AGGGGAGGCC AGGGAATTTC CTCCAGAACA 
2161 CAACAGCCCC ACCAGCAGAG AGCTTCAGGT 
2221 AGAAAGAGAG GGAACCTTTA ACTTCCCTCA 
2281 AATAAAAGTA GAGGGCCAGA TAAAGGAGGC 
2341 ATTAGAAGAA ATAGATTTGC CAGGGAAATG 
2401 TTTTATCAAA GTAAGACAGT ATGATCAAAT 
2461 AGGTACAGTA TTAGTAGGGC CTACACCAGT 
2521 GCTTGGATGC ACACTAAATT TTCCAATTAG 
2581 ACCAGGAATG GATGGCCCAA AGGTCAAACA 
2641 ATTAACAGCA ATTTGTGAGG AAATGGAGAA 
2701 TAATCCATAT AACACTCCAG TATTTGCCAT 
2761 ATTAGTAGAT TTCAGGGAAC TCAATAAAAG 
2821 AATACCACAC CCAGCAGGAT TAAAAAAGAA 
2881 TGCATATTTT TCAGTTCCTT TAGATGAAAG 
2941 TAGTATAAAC AATGAAACAC CAGGGATTAG 
3001 GAAAGGATCA CCAGCAATAT TCCAGAGTAG 
3061 AAAAAATCCA GACATAGTTA TCTATCAATA 
3121 AGAAATAGGG CAACATAGAG CAAAAATAGA 



AAGAAATCCT TGATTTGTGG GTCTATCACA 
ACACACCGGG GCCAGGGGTC AGATATCCAC 
CAGTTGACCC AGGGGAGGTG GAAGAGGCCA 
CTATGAGCCA ACATGGAGCA GAGGATGAAG 
GCCTCCTAGC ACGCAGACAC ATGGCCCGCG 
GACACAGAAG GGACTTTCCG CCTGGGACTT 
GGGCGGGACT TGGGAGTGGT CAACCCTCAG 
GTACTGGGTC TCTCTCGGTA GACCAGATCT 
ACCCACTGCT TAAGCCTCAA TAAAGCTTGC 
TGTTGTGTGA CTCTGGTAAC TAGAGATCCC 
CTAGCAGTGG CGCCCGAACA GGGACCAGAA 
ACGCAGGACT CGGCTTGCTG AAGTGCACAC 
ACGCCAATTT TACTTGACTA GCGGAGGCTA 
TATTAAGCGG CGGAAAATTA GATAAATGGG 
AACATTATAT GTTAAAACAT CTAGTATGGG 
ACCCTGGCCT GTTAGAAACA TCAGAAGGCT 
CTCTTCAGAC AGGAACAGAG GAACTTAGAT 
GTGTACATAA AGGGATAGAG GTACGAGACA 
AACAAAACAA ATGTCAGCAA AAAGCACAAC 
AAAATTATCC TATAGTACAG AATGCCCAAG 
GAACATTGAA TGCATGGATA AAAGTAATAG 
CCATGTTTAC AGCATTATCA GAAGGAGCCA 
CAGTGGGGGG ACATCAAGCA GCCATGCAAA 
CAGAATGGGA TAGGACACAT CCAGTACATG 
AACCAAGGGG AAGTGACATA GCAGGAACTA 
TGACAAGTAA TCCACCTATT CCAGTAGAAG 
TAAATAAAAT AGTAAGAATG TATAGCCCTG 
AAGAACCCTT TAGAGACTAT GTAGACCGGT 
CACAAGATGT AAAGAATTGG ATGACAGACA 
GTAAGACCAT TTTAAGAGCA TTAGGACCAG 
GTCAGGGAGT GGGAGGACCT AGCCATAAAG 
CAAACAGTAA CATACTAGTG CAGAGAAGCA 
GTTTCAACTG TGGCAAAGTA GGGCACATAG 
GCTGTTGGAA ATGTGGACAG GAAGGACACC 
ATTTTTTAGG GAAAATTTGG CCTTCCCACA 
GACCAGAGCC AACAGCCCCA CCAGCAGAAC 
TCGAGGAGAC AACCCCCGTG CCGAGGAAGG 
AATCACTCTT TGGCAGCGAC CCCTTGTCTC 
TCTCTTAGAC ACAGGAGCAG ATGATACAGT 
GAAACCAAAA ATGATAGGGG GAATTGGAGG 
ACTTATAGAA ATTTGTGGAA AAAAGGCTAT 
CAACATAATT GGAAGAAATC TGTTAACTCA 
TCCTATTGAA ACTGTACCAG TAAAATTAAA 
ATGGCCATTG ACAGAAGAAA AAATAAAAGC 
GGAAGGAAAA ATTACAAAAA TTGGGCCTGA 
AAAAAAGAAG GACAGTACTA AGTGGAGAAA 
AACTCAAGAC TTTTGGGAAG TTCAATTAGG 
AAAATCAGTG ACAGTGCTAG ATGTGGGGGA 
CTTCAGGAAA TATACTGCAT TCACCATACC 
ATATCAATAT AATGTGCTGC CACAGGGATG 
CATGACAAAA ATCTTAGAGC CCTTCAGAGC 
TATGGATGAC TTGTATGTAG GATCTGACTT 
AGAGTTAAGG GAACATTTAT TGAAATGGGG 



FIGURE IB 

3181 ATTTACAACA CCAGACAAGA AACATCAAAA 
3241 ACTCCATCCT GACAAATGGA CAGTACAACC 
3301 TGTCAATGAT ATACAGAAGT TAGTGGGAAA 
3361 GATTAAAGTA AGGCAACTCT GTAAACTCCT- 
3421 ACCACTAACT GAAGAAGCAG AATTAGAATT 
3481 AGTACATGGA GTATATTATG ATCCATCAAA 
3541 GCATGAACAA TGGACATATC AAATTTATCA 
3601 GTATGCAAAA ATGAGGACTA CCCACACTAA 
3661 AAAAATAGCC ATGGAAAGCA TAGTAATATG 
3721 CCAAAAAGAA ACATGGGAGA CATGGTGGAC 
3781 GTGGGAGTTT GTTAATACCC CTCCCCTAGT 
3841 CATAGCAGGA GTAGAAACTT TCTATGTAGA 
3901 AAAAGCAGGG TATGTTACTG ACAGAGGAAG 
3961 AAATCAGAAG ACTGAGTTAC AAGCAATTCA 
4021 AAACATAGTA ACAGACTCAC AGTATGCATT 
4081 TGACTCAGAG ATATTTAACC AAATAATAGA 
4141 GTCATGGGTA CCAGCACATA AAGGAATTGG 
4201 TAAGGGAATT AGGAAAGTGT TGTTTCTAGA 
4261 AAGGTACCAC AGCAATTGGA GAGCAATGGC 
4321 AAAAGAAATA GTAGCTAGCT GTGATAAATG 
4 381 AGTCGACTGT AGTCCAGGGA TATGGCAATT 
44 41 CCTGGTAGCA GTCCATGTAG CTAGTGGCTA 
4 501 AGGACAAGAA ACAGCATATT TTATATTAAA 
4561 ACATACAGAC AATGGCAGTA ATTTTACCAG 
4 621 AGGTATCCAA CAGGAATTTG GAATTCCCTA 
4 681 CATGAATAAA GAATTAAAGA AAATAATAGG 
4 741 GACAGCAGTA CAAATGGCAG TATTCATTCA 
4801 GTACAGTGCA GGGGAAAGAA TAATAGACAT 
4 861 ACAAAAACAA ATTATAAGAA TTCAAAATTT 
4 921 TATTTGGAAA GGACCAGCCG AACTACTCTG 
4 981 TAAAGGTGAC ATAAAGGTAG TACCAAGGAG 
5041 ACAGATGGCA GGTGCTGATT GTGTGGCAGG 
5101 GTTTAGTAAA GCACCATATG TATATATCAA 
5161 ATTTTGAAAG CAGACATCCA AAAGTAAGTT 
5221 GATTAGTAAT AAAAACATAT TGGGGTTTGC 
5281 ATGGAGTCTC CATAGAATGG AGACTGAGAG 
5341 CAGACCAGCT AATTCACATG CATTATTTTG 
5401 CCATATTAGG ACACATAGTT TTTCCTAGGT 
54 61 GATCTCTGCA ATACTTGGCA CTGACAGCAT 
5521 TGCCTAGTGT TAGAAAATTA GTAGAGGATA 
5581 GCAGAGGGAA CCATACAATG AATGGACACT 
5641 TGTCAGACAC TTTCCTAGAC CATGGCTCCA 
5701 TGGGGATAQT TGGACGGGAG TTGAAGCTAT 
5761 TCATTTCAGA ATTGGATGCC AACATAGCAG 
5821 AAATGGAGCC AGTAGATCCT AAACTAAAGC 
5881 CAGCTTGTAA TAATTGCTTT TGCAAACACT 
5941 CAAAAGGTTT AGGCATTTCC TATGGCAGGA 
6001 CAAGTGGTGA AGATCATCAA AATCCTCTAT 
6061 TGGTAAGTTT AAGTTTATTT AAAGGAGTAG 
6121 TAGCACTAAT CATAGCAATA ATAGTGTGGA 
6181 TAAGACAAAA GAAAATAGAC TGGTTAATTA 
6241 GCAATGAGAG TGATGGGGAC ACAGAAGAAT 
6301 GGCTTCTGGA TGCTAATGAT TTGTAACACG 



AGAACCCCCA TTTCTTTGGA TGGGGTATGA 
TATACTGCTG CCAGAAAAGG ATAGTTGGAC 
ATTAAACTGG GCAAGTCAGA TTTACCCAGG 
CAGGGGGGCC AAAGCACTAA CAGACATAGT 
GGCAGAGAAC AGGGAAATTT TAAGAGAACC 
AGACTTGATA GCTGT^TAC AGAAACAGGG 
AGAACCATTT AAAAATCTGA AAACAGGGAA 
TGATGTAAAA CAGTTAACAG AGGCAGTGCA 
GGGAAAGACT CCTAAATTTA GACTACCCAT 
AGACTATTGG CAAGCCACCT GGATCCCTGA 
AAAATTATGG TACCAACTAG AAAAAGATCC 
TGGAGCAACT AATAGGGAAG CTAAAATAGG 
GCAGAAAATT GTTACTCTAA CTAACACAAC 
GCTAGCTCTG CAGGATTCAG GATCAGAAGT 
AGGAATCATT CAAGCACAAC CAGAT7VAGAG 
ACAGTTAATA AACAAGGAAA GAATCTACCT 
GGGAAATGAA CAAGTAGATA AATTAGTAAG 
TGGAATAGAT AAAGCTCAAG AAGAGCATGA 
TAATGAGTTT AATCTGCCAC CCATAGTAGC 
TCAGCTAAAA GGGGAAGCCA TACATGGACA 
AGATTGTACC CATTTAGAGG GAAAAATCAT 
CATGGAAGCA GAGGTTATCC CAGCAGAAAC 
ATTAGCAGGA AGATGGCCAG TCAAAGTAAT 
TACTGCAGTT AAGGCAGCCT GTTGGTGGGC 
CAATCCCCAA AGTCAGGGAG TGGTAGAATC 
ACAAGTAAGA GATCAAGCTG AGCACCTTAA 
CAATTTTAAA AGAAAAGGGG GAATTGGGGG 
AATAGCAACA GACATACAAA CTAAAGAATT 
TCGGGTTTAT TACAGAGACA GCAGAGACCC 
GAAAGGTGAA GGGGTAGTAG TAATAGAAGA 
GAAAGCAAAA ATCATTAGAG ATTATGGAAA 
TGGACAGGAT GAAGATTAGA GCATGGAATA 
GGAGAGCTAG TGGATGGGTC TACAGACATC 
CAGAAGTACA TATCCCATTA GGGGATGCTA 
AGACAGGAGA AAGAGATTGG CATTTGGGTC 
AATACAGCAC ACAAGTAGAC CCTGACCTGG 
ATTGTTTTAC AGAATCTGCC ATAAGACAAG 
GTGACTATCA AGCAGGACAT AAGAAGGTAG 
TGATAAAACC AAAAAAGAGA AAGCCACCTC 
GATGGAACGA CCCCCAGAAG ACCAGGGGCC 
AGAGATTCTA GAAGAACTCA AGCAGGAAGC 
TAGCTTAGGA CAATATATCT ATGAAACCTA 
AATAAGAGTA CTGCAACAAC TACTGTTCAT 
AATAGGCATC TTGCGACAGA GAAGAGCAAG 
CCTGGAACCA TCCAGGAAGC CAACCTAAAA 
GTAGCTATCA TTGTCTAGTT TGCTTTCAGA 
AGAAGCGGAG ACAGCGACGA AGCGCTCCTC 
CAAAGCAGTA AGTACACATA GTAGATGTAA 
ATTATAGATT AGGAGTAGGA GCATTGATAG 
CCATAGCATA TATAGAATAT AGGAAATTGG 
AAAGAATTAG GGAAAGAGCA GAAGACAGTG 
TGTCAACAAT GGTGGATATG GGGCATCTTA 
GAGGACTTGT GGGTCACAGT CTACTATGGG 



FIGURE IC 

6361 GTACCTGTGT GGAGAGAAGC AAAAACTACT CTATTCTGTG CATCAGATGC TAAAGCATAT 
6421 GAGACAGAAG TGCATAATGT CTGGGCTACA CATGCTTGTG TACCCACAGA CCCCAACCCA 
6481 CAAGAAATAG TTTTGGGAAA TGTAACAGAA AATTTTAATA TGTGGAAAAA TAACATGGCA 
6541 GATCAGATGC ATGAGGATAT AATCAGTTTA TGGGATCAAA GCCTAAAGCC ATGTGTAAAG 
6601 TTGACCCCAC TCTGTGTCAC TTTAAACTGT ACAGATACAA ATGTTACAGG TAATAGAACT 
6661 GTTACAGGTA ATACAAATGA TACCAATATT GCAAATGCTA CATATAAGTA TGAAGAAATG 
6721 AAAAATTGCT CTTTCAATGC AACCACAGAA TTAAGAGATA AGAAACATAA AGAGTATGCA 
6781 CTCTTTTATA AACTTGATAT AGTACCACTT AATGAAAATA GTAACAACTT TACATATAGA 
6841 TTAATAAATT GCAATACCTC AACCATAACA CAAGCCTGTC CAAAGGTCTC TTTTGACCCG 
6901 ATTCCTATAC ATTACTGTGC TCCAGCTGAT TATGCGATTC TAAAGTGTAA TAATAAGACA 
6961 TTCAATGGGA CAGGACCATG TTATAATGTC AGCACAGTAC AATGTACACA TGGAATTAAG 
7021 CCAGTGGTAT CAACTCAACT ACTGTTAAAT GGTAGTCTAG CAGAAGAAGG GATAATAATT 
7081 AGATCTGAAA ATTTGACAGA GAATACCAAA ACAATAATAG TACATCTTAA TGAATCTGTA 
7141 GAGATTAATT GTACAAGGCC CAACAATAAT ACAAGGAAAA GTGTAAGGAT AGGACCAGGA 
7201 CAAGCATTCT ATGCAACAAA TGACGTAATA GGAAACATAA GACAAGCACA TTGTAACATT 
7261 AGTACAGATA GATGGAATAA AACTTTACAA CAGGTAATGA AAAAATTAGG AGAGCATTTC 
7321 CCTAATAAAA CAATAAAATT TGAACCACAT GCAGGAGGGG ATCTAGAAAT TACAATGCAT 
7381 AGCTTTAATT GTAGAGGAGA ATTTTTCTAT TGCAATACAT CAAACCTGTT TAATAGTACA 
7441 TACTACCCTA AGAATGGTAC ATACAAATAC AATGGTAATT CAAGCTTACC CATCACACTC 
7501 CAATGCAAAA TAAAACAAAT TGTACGCATG TGGCAAGGGG TAGGACAAGC AATGTATGCC 
7561 CCTCCCATTG CAGGAAACAT AACATGTAGA TCAAACATCA CAGGAATACT ATTGACACGT 
7621 GATGGGGGAT TTAACAACAC AAACAACGAC ACAGAGGAGA CATTCAGACC TGGAGGAGGA 
7 681 GATATGAGGG ATAACTGGAG AAGTGAATTA TATAAATATA AAGTGGTAGA AATTAAGCCA 
7741 TTGGGAATAG CACCCACTAA GGCAAAAAGA AGAGTGGTGC AGAGAAAAAA AAGAGCAGTG 
7801 GGAATAGGAG CTGTGTTCCT TGGGTTCTTG GGAGCAGCAG GAAGCACTAT GGGCGCAGCG 
78 61 TCAATAACGC TGACGGTACA GGCCAGACAA CTGTTGTCTG GTATAGTGCA ACAGCAAAGC 
7921 AATTTGCTGA AGGCTATAGA GGCGCAACAG CATATGTTGC AACTCACAGT CTGGGGCATT 
7981 AAGCAGCTCC AGGCGAGAGT CCTGGCTATA GAAAGATACC TAAAGGATCA ACAGCTCCTA 
8041 GGGATTTGGG GCTGCTCTGG AAGACTCATC TGCACCACTG CTGTGCCTTG GAACTCCAGT 
8101 TGGAGTAATA AATCTGAAGC AGATATTTGG GATAACATGA CTTGGATGCA GTGGGATAGA 
8161 GAAATTAATA ATTACACAGA AACAATATTC AGGTTGCTTG AAGACTCGCA AAACCAGCAG 
8221 GAAAAGAATG AAAAAGATTT ATTAGAATTG GACAAGTGGA ATAATCTGTG GAATTGGTTT 
8281 GACATATCAA ACTGGCTGTG GTATATAAAA ATATTCATAA TGATAGTAGG AGGCTTGATA 
8341 GGTTTAAGAA TAATTTTTGC TGTGCTCTCT ATAGTGAATA GAGTTAGGCA GGGATACTCA 
8401 CCTTTGTCAT TTCAGACCCT TACCCCAAGC CCGAGGGGAC TCGACAGGCT CGGAGGAATC 
84 61 GAAGAAGAAG GTGGAGAGCA AGACAGAGAC AGATCCATAC GATTGGTGAG CGGATTCTTG 
8521 TCGCTTGCCT GGGACGATCT GCGGAGCCTG TGCCTCTTCA GCTACCACCG CTTGAGAGAC 
8581 TTCATATTAA TTGCAGTGAG GGCAGTGGAA CTTCTGGGAC ACAGCAGTCT CAGGGGACTA 
8641 CAGAGGGGGT GGGAGATCCT TAAGTATCTG GGAAGTCTTG TGCAGTATTG GGGTCTAGAG 
8701 CTAAAAAAGA GTGCTATTAG TCCGCTTGAT ACCATAGCAA TAGCAGTAGC TGAAGGAACA 
8761 GATAGGATTA TAGAATTGGT ACAAAGAATT TGTAGAGCTA TCCTCAACAT ACCTAGGAGA 
8821 ATAAGACAGG GCTTTGAAGC AGCTTTGCTA TAAAATGGGA GGCAAGTGGT CAAAACGCAG 
8881 CATAGTTGGA TGGCCTGCAG TAAGAGAAAG AATGAGAAGA ACTGAGCCAG CAGCAGAGGG 
8941 AGTAGGAGCA GCGTCTCAAG ACTTAGATAG ACATGGGGCA CTTACAAGCA GCAACACACC 
9001 TGCTACTAAT GAAGCTTGTG CCTGGCTGCA AGCACAAGAG GAGGACGGAG ATGTAGGCTT 
9061 TCCAGTCAGA CCTCAGGTAC CTTTAAGACC AATGACTTAT AAGAGTGCAG TAGATCTCAG 
9121 CTTCTTTTTA AAAGAAAAGG GGGGACTGGA AGGGTTAATT TACTCTAGGA AAAGGCAAGA 
9181 AATCCTTGAT TTGTGGGTCT ATAACACACA AGGCTTCTTC CCTGATTGGC AAAACTACAC 
9241 ATCGGGGCCA GGGGTCCGAT TCCCACTGAC CTTTGGATGG TGCTTCAAGC TAGTACCAGT 
9301 TGACCCAAGG GAGGTGAAAG AGGCCAATGA AGGAGAAGAC AACTGTTTGC TACACCCTAT 
9361 GAGCCAACAT GGAGCAGAGG ATGAAGATAG AGAAGTATTA AAGTGGAAGT TTGACAGCCT 
9421 TCTAGCACAC AGACACATGG CCCGCGAGCT ACATCCGGAG TATTACAAAG ACTGCTGACA 



FIGURE ID 

9481 CAGAAGGGAC TTTCCGCCTG GGACTTTCCA 

9541 GGGACTTGGG AGTGGTCACC CTCAGATGCT 

9601 GGGTCTCTCT CGGTAGACCA GATCTGAGCC 

9661 CTGCTTAGGC CTCAATAAAG CTTGCCTTGA 

9721 TGTGACTCTG GTAACTAGAG ATCCCTCAGA 

9781 A 



CTGGGGCGTT CCGGGAGGTG TGGTCTGGGC 
GCATATAAGC AGCTGCTTTT CGCTTGTACT 
TGGGAGCTCT CTGGCTATCT AGGGAACCCA 
GTGCTCTAAG TAGTGTGTGC CCATCTGTTG 
CCCTTTGTGG TAGTGTGGAA AATCTCTAGC 



Figure 2A 

I : indicates the regions for p-sheet and V\fW2 loop deletions 

*: is the N-linked glycosylation sites for subtype C TVl and TV2. Possible 
mutation (N-^ Q) or deletions can be performed. 



B-SFl 62 ( 1 ) MDA^CKRGLCCVLLLCGAVl^WSP-SAVEKLWV^/Y'lrGVPVWKEATTT 

C-TV1.8_2 (1) MRVMGTQkNCQQWWIWGILGFWMLMI-CNTEDLWTVYYGVPVWRDAKTT 

C-TV1.8_5 (1) MRVMGTQKNCQQWWIWGILGFWMLMI-CNTEDLWTVYYGVPVWREAKTT 

C-TV2. 12-5/1 (1) ^1RARGILkNYRHWWIWGILGFWMLMM-C^A«GtWTVYYGVPVGRE■AkTT 

C -M J4 ( 1 ) MRVKG I PRNWQQWWIWGS LGFWI I C— SVMGNL WTv'^r Y GV PVWRE AKTT 

IndiaC- 93IN1 01 ( 1 ) mrvrgtlrnyqqwwiwgvlgfwmlmicngggnlwvtvyygypvwkeaktt 

A-Q2317 (1) MRVMGIQRNCQHLLTWGIMILGTIIFCSAVENLWVTVYYGVP'v/WRPADTT 

D- 92UG0 0 1 (1) MRVREIERNYLCLWRWGIMLLGMLMTYSVAEKKWTW YGVPVWKEATTT 

E-cm235 (1) MDAtKRGLCqVLLIiCGAVFySP-SASWNLWVTVYYGyPVWRDADTT 

Consensus (1) MRV G RN Q WWiwGILGFWMLM S E LWVTVYYGVPVWREAKTT 



B-SF162 (46) LFCASDAKAYDTEVHNVWATHACVPTDPNPQEiyLENVTENFNl-BJKNNMV 
C-TVl . 8_2 (50) LFCASDAKAYETEVHNWJATHACVPTDPNPQEIVLGNVTENFNMWKNDMA 
C-TVl . 8_5 (50) LFCASDAKAYETEVHNVWATHACVPTDPNPQEI^iLGNVTENFNMWKNNt-IA 
C-TV2. 12-5/1 (50) LFCASDAKAYEKEVHimJATHACVPTDPNPQEjpiLGNVTENFNMWKNDMV 
C-MJ 4 (49) LFCASDAKAYEAEVHNVWATHACVPTDFNPQEIELKNVTENFtTOWENDMV 
IndiaC-93IN101 (51) LLCASDAKAYEREVHNVWATHACYPTDPHPQEIVLGWVTENFNMWKNDMV 
A-Q2 317 (51) LFCASDAKAYETEKHIW-JATHACVPTDPNPQEIHLDNVTEKFNMWKNNMV 

D-92UG001 (51) LFCASDAKSYKTEVHNlWATHftCVFTDPNPREIELENVTENFNMWKtWlV 

E-cm2 3 5 (46) LFCA SDAKAMETEVHhJVWATHACV P T B FN PQE I HLENV TEN FNMWKNNMV 

Consensus (51) LFCASDAKAYETEVHNVWATHACVPTDPNPQEIVL NVTENFNMWKNNMV 

, P2/V1V2/63 
101 I * * * * *150 

B-SF162 (96) EQMHEDIISLWDQSLKPCVKLTPLCVTLHCTNLKNATN TKSS 

C-TVl , 8_2 (100) DQ.MHEDYI 3LWDQ.SLKPCVKLTPLCVTLNCTDTNVTGNRTVTGNSTNNTN 

C-TVl . 8_5 (100) DQMHEDIISLKDQSLKPCVKLTPLCVTLNCTDTNVTGNRTVTGNTNDTNI 

C-TV2 . 12-5/1 (100) DQMQEDIISLWDQSLKPCVKLTPLCVTLNCTNATVNYN NTS— 

C-MJ 4 (99) DQHHEDI I S LWDQSLKPCVKLTPLCVTLNCKNVTSKDI NI 

IndiaC-93IN101 (101) DQS-iHEDVISLWDQSLKFCVKLTPLCVTLECRNVSRNVS SY 

A-Q2317 (101) EQMHTDIISLWDQSLKPCVKLTPLCVTLHCTNVTSVNT 

D-92UG001 (101) EQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDARRNET RNNIT 

E-Cm235 (96) EQMQEDVISLWDQSLKPCVKLTPLCVTLNCTNAKLTNV NNITSVS 

Consensus (101) DQMHEDIISLWDQSLKPCVKLTPLCVTLNCTN 



151 * * 200 

(138) NWKEMDRGEtKNCSFWTSlRNKMQKEYALFmDWPXDN TMg 

(150) G'i:GIYNljEEMKHCSFNATTELRDKKHKEYALFYRLD'lVPLN---ENSDNF 

(150) MATYKyIeEMKNCSFNATTELRDKKHKEYALFYIcLDIVPLN ENSNNF 

(141) KDMKNCSFYVTTELRDKKKKENALFYRiDIVPLNNR-KNGNIN 

(139) TSNAEMKAEMKWCSFNITTELRDKKKQEYALFYKLDIVPLTNDNASENAS 
(141) t^PYNGS\^EjKNCSFNATPEyRDRKQRMYALFYGLDIVPLNKKNSSENSS 

X. (139) -3:G--DI^GEK>1C3FM>1TTELRDKRW NQGS 

D-92UG001 (144) . GMENNDQiEMKNCSFNITTKLIDKKKQVHALFYfoDlS^/QIDNDTSN&NYS 
E-cm235 (141) NTIGNIT;DEVroJCSFNMTTELRDKKQKV!HALFYRLDIVPl:ED---NKTSS 
Consensus (151) T EEMKNCSFNITTELRDKK KEYALFYKLDIVPLN N N S 



B-SF162 
C-TVl . 8_2 
C-TVl . 8_5 
C-TV2. 12-5/1 
C-MJ4 
IndiaC-93IN101 
A-Q2317 



Figure 2B 

201 * ^ * * 250 

B-SF162 (183) SYKLIMCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGSGPC 
C-TVl . 8_2 (197) TYRLINCNTSTITQACPKVSFDPIPIHi'CAPAGYAILKCNNKTFNGTGPC 
C-TVl . 8_5 (197) TYRLINCNTSTITQACPKVSFDPIPIHYCAPADYAILKCNNKTFNGTGPC 
C-TV2 . 1 2 -5 / 1 (183) NYRL I NCNTSAITQACPKVSFDP I P I HYCAPAGYAPLKCNNKKFNG IGPC 
C-MJ4 (189) EYRLINCDTSTITQSCPKVTFDPI PIHYCAFAGYVILKCNNKTFNGTGPC 
IndiaC-93IN101 (191) EYRLINCNTSAITQACPKVTFDPIPIHYCAP/aGYAILKCNNKTFMGTGPC 
A-Q23 1 7 (182) EYRLINCNTSAITQACPKVSFEPI PI HYCTPAGFAI LKCKDEGFNGTGLC 
D-92UG001 (194) NYRLINCNTSAITQACPKVTFEPIPIHYCAPAGFAILKCRDKKFNGTGPC 
E-cm235 (188) EYRLINCNTSVIKQACPKiiSFDPIPlHYCTPAGYArLKCNDKNFNGTaPC 
Consensus (201) YRLINCNTS ITQACPKVSFDPIPIHYCAPAGYAILKCNNK FNGTGPC 



B-SF162 
C-TVl . 8_2 
C-TVl . 8_5 
C-TV2. 12-5/1 
C-MJ4 
IndiaC-93lN101 
A-Q2317 
D-92UG001 
E-cm235 
Consensus 



251 * * 300 

(233) TNVSTVQCTHGIBPWSTQLLLNGSLAEEGVVIRSENFTDMAKTIIVQLK 
(247) YNVSTVQCTHGIKPWSTQLLLHGSLAEEGI I IR3ENLTENTKTHVHLN 
(247) YNVSTVgCTHGIKP'v'VSTQLLLNGSLAEEGIIIRSENLTENTKTIIVHLN 
(233) DNVSTVQCTHGIKPV^'STQLLLNGSLAEEEIIIRSENLTNKVKTIIVHLN 
(239) NNVSTVQCTHGIKPWSTQLLLNGSLAEKEIIIRSKNIiTDNVKTI IVHLN 
(241) NNVSTVQCTHGIKPVVSTQLLLNGSLAEGEI I IRSEHLTNNVKTI IVHLN 
(232) Kt^ySTVQCTHGIKPV,/STQLLLNGSLAEKNITIRSEMITNNAKIIIVQLV 
(244) KNVSTVQCTHGlSPVVSTQLLLNGSLAEEEIIiRSEHLTNKAKTlllVQLN 
(238) KNVS;SVQCTHGIKPWSTQLLLNGSLAEEEIIIk3EKLTNKAKTIIVHLN 
(251) NVSTVQCTHGIKPWSTQLLLNGSLAEEEIIIRSENLTNN KTIIVHLN 



301 



350 



B-SF162 (283) ESVEINCTRPN-NNTRKSITIGPGPAFYATGDIIGDIRQAHCNISGEkWN 

C-TVl . 8_2 (297) ESVEINCTKPN-^5NTRKSVRIGPGQAFYAT^iDVIGNIRQAH!:NISTDRv'}H 

C-TVl .8_5 (297) ESVEINCTRPN-NNTRKSVRIGPGQAFYATNDVIGNIRQAHCNISTDRWN 

C-TV2. 12-5/1 (283) ESIEIKCTRPG-NNTRKSVRIGPGQAFYATGDIIGDIRQAHCNISKNEWN 

C-MJ4 (289) ESVEIECTRPG-MNTRRSVRIGPGQAFYATGDIIGDIRAAHCNISESKi-JN 

IndiaC-93IN101 (291) QSVEIVCTRPN-KNTRKSIRIGPGQTFYATGDIIGDIRQAHCNISRDjfwN 

A-Q2317 (282) QPVT I KG IRPN-NNTRKSIRI GPGQAFYATGDI I GDIRQAHCNJfflRSRWN 

D-92UG001 (294) ESVEINCTRPYYNiaiRQRTSIGQGQALYTTR-OTGDIRKAil^CNfSKAGWN 

E-cm235 (288) KSVEINCTRPS-NNTRTSliTIGPGQVFYRTGDIIGDIRKA%EINGTKWN 

Consensus (301) ESVEINCTRPN NNTRKSIRIGPGQAFYATGDIIGDIRQAHCNIS KWN 



351 * *400 

B-SF162 (332) NTLKQIVTKLQAQFGNKT-IVFKQSSGGDPEIVMHSFNCGGEFFYCNSTQ 

C-TVl . 8_2 (346) KTLQQVMKKLGEHFPNKT-IQFKPHAGGDLEITMHSFNCRGEFFYC^3TSN 

C-TVl . 8_5 (346) KTLQQVMKfa.GEHFPNKT-IKFEPHAGGDLEITMHSFNCRGEFFYCHTSN 

C-TV2. 12-5/1 (332) TTLQRVSQKLQELFPNSTGIKFAPHSGGDLEITTHSFNCGGEFFYCNTTD 

C-MJ4 (338) KILYRVSEKLKEHFPNKT-IQFDQPIGGDLEITTHSFNCRGEFFYCNTSK 

IndiaC-93IN101 (340) ETLQRVGKKLAEHFHNKT-IKFASSSGGDLEITTHSFNCRGEFFYCNTSG 

A-Q2317 (331) KTLQEVAEKLRT^JFGNKT-IIFANSSGGDLEITTHSFNCGGEFFYCNTSG 

D-92UG001 (343) KTLQQVAKKLGB'lFNQTT-IIFKPSSGGDPEITTHSFNCGGEFFYCNTSK 

E-cm235 (337) EVLTQVTEKLKEHFNNKT-IIFQPPSGGDLEITMHHFNCRGEFFYCNTTR 

Consensus (351) KTLQQV KL EHF NKT I F P SGGDLEITTHSFNCRGEFFYCNTS 



Figure 2C 



B-SF162 
C-TVl . 8_2 
C-TV1.8_5 
C-TV2. 12-5/1 
C-MJ4 
IndiaC-93IN101 
A-Q2317 
D-92UG001 
E-cm235 
Consensus 



^01 * * I P20/P21 'Ir50 

(381) LFNSTWNN TIGPNN--TNGTITLPCR1KQHNRWQEVGKAMYAPP 

(395) LFMSTYHS— NNGTYKYNGNSSSPITLQCKIKQIVRMWQGVGQATYAPP 
(395) LFNSTYYP— KNGTYKYNGNSSLPITLQCKIKOIVPJi^WQGVGQAMYAPP 

(382) LFNSTYSNGTCTNGTCMSN— NTERITLQCRIKQIINMWQEVGRAMYAPP 

(387) LFMGTYNS TGDTSN STITLSCRIKQI INMWQGVGRAMVASP 

(389) LFNGTYMPTYMPNGTESNS— NSTITIPCRIKQIINMWQEVGRZiMYAPP 
(380) LFHSTWYVNSTWNDTDSTQ-ESNDTITLPCRIKQI INMWQRAGQAMYAPP 
(3 92 ) LF MSAWND-STWNIGNNNTGSDNETI IiIPCRI KQI INMWQGVGKAMYAPP 

(386) LFNNTCIE NGTMGGC— NGTIILPCKIKQIINMWQGAGQAMYAPP 

(401) LFNSTY NGT N N TITLPCRIKQIINMWQGVGRAMYAPP 



B-SF162 
C-TVl . 8_2 
C-TV1.8_5 
C-TV2. 12-5/1 
C-MJ4 
IndiaC-93IN101 
A-Q23n 
D-92UG001 
E-cm235 



451* * * * 

(424) IRGQIRCSSNITGLLLTRDGGKEISNT— TEIFRPGGGDMRDNWRSELY 

(442) lAGNITCRSNITGILLTRDGGFNTTNN TETFRPGGGDMRDMWRSELY 

(442) lAGNITCRSMITGILLTRDGGFNNtNkDT-EETFRPGGGDMRDNWRSELY 

(430) lAGNITCRSMITGLLLTRDGGDNNTET ETFRPGGGDMRDNWRSELY 

(428) IAGNITCK3MITGLLLTRDGGNETSGI EIFRPAGGDMRDMWRSSLY 

(436) I AGN I TCTSMITGLLLVHDGGIKENDTENKTEI FRPGGGDMRDNWRSELY 

(429) IPGVIKCE3NITGLLLTRDGGKDNNVN ETFRPGGGDMRDNWRSELY 

(441) lEGWINCASNITGLLLVRDGGGANDsb NETFRPQGGDMRDNWRSELY 

(429) IS:grinCVSNITGILLTRDGGAINT;TN' ETFRPGGGNi-KDNWRSELY 



Consensus (451) lAGNITC SNITGLLLTRDGG NT N ETFRPGGGDMRDNWRSELY 



B-SF162 
C-TVl . 8_2 
C-TVl . 8_5 
C-TV2. 12-5/1 
C-MJ4 
IndiaC-93IN101 
A-Q2317 
D-92UG001 
E-cm235 
Consensus 



501 550 
(471) KYKVVKIEPLGVAPTKAKRRyVQREKRAVTLG^yjiiFLGFLGAAGSTMGARS 
(489) KYKWEIKPLGrAPTKAKRRVVQREKRAVGIGAVFLGFLGAAGSTMGAAS 
(491) KYKVVEIKPLGIAPTKAKRRVVQRKKRAVGIGAVFLGFLGAAGSTMGAAS 
(476) KYKVVEIKPLGVAPTAAKRRWEREKRAVGIGAVFLGFLGAAGSTMG/iAS 

(474) KYKVVEIKPLGLAPTKSKRRV/EREKPJWTFGAMFLGFLGAAGSTMGAAS 
(4 86) KYKWEXKPLGVAPTAAKRRVVEREKRAVGIGAVFLGFLGAAGSTMGAAS 

(475) KYKVVEIEPLGVAPTRAKRRVVEREKRAVGIGAVFLGFLGAAGSTMGATf! 
(488) KYKVVKIEPLGIAPTKAKRRWEREKRAIGLGAMFLGFLGAAGS TMGAAS 
(475) KYKVVQIEPLG|APTRAKRRV^/EREKRAVGIGAMIFGFLGAAGSTMG7iAS 
(501 ) KYKWEIKPLGIAPTKAKRRWEREKRAVGIGAVFLGFLGAAGSTMGAAS 



551 



600 



B-SF162 (521) liTLTVQARQLLSGiyQQQNl-iLLRA.IEAOQHLLQLTVWGTKQLQARVLAVE 

C-TVl . 8_2 (539) ITLTVQARQLLSGiyQQQSNLLKAIEAQQHMLQLTVWGI KQLQARVLAIE 

C-TVl . 8_5 (541) ITLTVQARQLLSGIVQQQSNLLKAIEAQQHMLQLTWGIKQLQARVLA^IE 

C-TV2 . 12-5/1 (526) ITLTVQARQLLSGIVQQQSNLLRAlEAQQHMLQLTWGIKQLQARVLA'iE 

C-MJ4 (524) mtltvqarqllsgivqqosnllraieaqqhm'lqltvwgikqlqtrvlave 

indiac-93iNioi (536) itltaoarqllsgivqqqsnllraieaqqhllqltvwgikqlqtrvlaIe 

A-Q2 317 (525) ITLTVQAF.QLLSGI VQQQKHLLRAIEAQQHLLKLTVWGIKQLQARVLAVE 

D-92UG001 (538) LTLTVQARQLLSGIVQHQNNLLMAIEAQQHLLQLTVWGIKQLQARILAVE 

E-cm235 (525) ITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAVE 

Consensus (551) ITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAVE 
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(591) RYLKDQQI-LGIWGCSG'RLICTTAVPWNSSWSNKSEADIWDNMTWMQWDRE 
(576) RYLQDQQLLGLWGCSGKLICTTNVLWNSSWSKKTQSDIWDNMTWMQWDRE 

(574) RYLRDQQLLGIWGCSGKLICTTAVPKNSSW3KKSQHDIWDNLTWMQWDRE 
(586) RYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKTQSEIIi'rtJNMTWMQWDRE 

(575) RYLRDQQLLGIWGCSGKLICTTNVPWNSSW3NKSLDEIV/NWMTWLQWDKE 
(588) RYLQDQQLLGSWGCSGRHICTTTVPWNSSWSNKSIDDIWNNMTWMEWEKE 
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(621) IDNYTNfclYTLIEESQNQQEKNEQELLELDKWASLWNWFDISKWLWYIKI 

(639) I SWYTGJi I YNLLEDSQNQQEKNEKDLLELDKWNNLWNWFDI SNWPWY IK 1 

(641) INNYTETIFRLLEDSQNQQEKNEKDLLELDKWNNLWNWFDISNWLWYIKI 

(626) I SN YTNT I YRLLED30SnQER>i£KDLLALDRvfl}NLWNWFSia'NWLWYIKI 

(624) ismytdtiyrlleesqnqqernekdllaldswktlwswfdisnwlwyiki 
( 636) vsnytniiyslleesqnqqeknekdllaldswknlwswfdiitnwlwy iki 

( 625 ) inhytqiilyrlieesqnqqekhekelleldkwanlwswfdi snwlwyiki 
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(651) isnytnliyrlleesqnqqeknekdlleldkw nlwnwfdisnwlwyiki 
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B-SFl 62 (671) FIMIVGGLyGLRIVFTVLS IVNRVRQG YS PLSFQTRFPAPRGPDRPEGI E 

C-TVl , 8_2 (689) FIMIVGGIIGLBIIFAVLSIVHRVRQGYSPLSFQTLTPSPRGLDRLGGIE 

C-TVl , 8_5 (691) FIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTLTPSPRGLDRLGGIE 

C-TV2 . 12-5/1 (676) FIMIVGGLIGLRIIFAVLSLVMRVRQGiSPLSLQTLIPNPRGPDRLGGIE 

C-MJ4 (674) FIMIVGSLIGLRIIFAVLSIVNRVRQGY3PLSFQTLTPNPRGPDRLEGIE 

IndiaC-93lN101 (686) FIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTLTPNPRGPDPXGRIE 

A-Q2317 (675) FI IIVGGLIGLRiyFAVLSViroWQGYSPLSFQ'THTPNPRGLDRPERIE 

D-92UG001 (688) FIMIVGGLIGLRIVFTVLSitVNRVRQGYSPLSFQTLFPAPRGPDRPEEIE 

E-cm235 (675) FIMIIGGLIGLRIIFAVL3IVNRVP.QGYSPLSFQTPFHHQREPDRSERIE 

Consensus (701) FIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTLTP PRGPDRLEGIE 
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(741) eeggeqdrdrsirlvsgfls'lawddlrslclfsyhrlrdfiliavravel 

(726) eeggeqdssrsirlvsgfltlawddlrslclfcyhrlrdfiliwravel 

(724) seggeqdkdrsiru^jgslalav/dplrslclfsyhqlrdfilvvaravel 
(736) eeggeqdkdrsirlvhgflalawddlrnlclpsyhrlrdfisvaarwel 

(725) eedgeqgrgrsirlvsgflalawddlrslclfsyhrlrdfiliaartveli 

(738) EGGGEQGRGRSTRLVNGFSTLIWDDLRHLCLFSYHRLRDLILIATRIVEL 

(725) EGGGEQGRDRSiVRLVSGFLALAWDDLRSLCLFSYHRLRDFILIAARTVKL 

(751) EEGGEQDRDRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILIAAR VEL 
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(786) LGRSS— WEALKYLGSLVQYWGLELKKSAISLFDSIAIWAEGTD 
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gpl40 .inodSF162 .delV2 



gaattcgccaccatggatgcaatgaagagagggctctgctgtgtgctgctgctgtgtggagcagtc 
ttcgtttcgcccagcgccgtggagaagctgtgggtgaccgtgtaccacggcgtgcccgtgtggaag 
gaggccaccaccaccctgttctgcgccagcgacgccaaggcctacgacaccgaggtgcacaacgtg 
tgggccacccacgcctgcgtgcccaccgaccccaacccccaggagatcgtgctggagaacgtgacc 
gagaacttcaacatgcggaagaacaacatggtggagcagacgcacgaggacatcatcagcctgtgg 
gaccagagcctgaagccctgcgtgaagctgacccccctgtgcgtgaccctgcactgcaccaacctg 
aagaacgccaccaacaccaagagcagcaactggaaggagacggaccgcggcgagatcaagaactgc 
agcttcaaggtgggcgccggcaagctgaccaactgcaacaccagcgcgaccacccaggcctgcccc 
aaggtgagccccgagcccatccccacccactactgcgcccccgccggccccgccatcccgaagtgc 
aacgacaagaagttcaacggcagcggcccctgcaccaacgtgagcaccgcgcagtgcacccacggc 
acccgccccgtggtgagcacccagccgctgctgaacggcagcctggccgaggagggcgtggcgacc 
cgcagcgagaacttcaccgacaacgccaagaccatcaccgcgcagccgaaggagagcgcggagatc 
aactgcacccgccccaacaacaacacccgcaagagcatcaccatcggccccggccgcgccttctac 
gccaccggcgacatcatcggcgacatccgccaggcccactgcaacatcagcggcgagaagtggaac 
aacaccctgaagcagatcgtgaccaagctgcaggcccagttcggcaacaagaccatcgtgttcaag 
cagagcagcggcggcgaccccgagatcgtgatgcacagcttcaaccgcggcggcgagttcttctac 
tgcaacagcacccagctgttcaacagcacctggaacaacaccatcggccccaacaacaccaacggc 
accatcaccctgccccgccgcatcaagcagatcatcaaccgccggcaggaggtgggcaaggccatg 
tacgccccccccatccgcggccagatccgctgcagcagcaacatcaccggcctgccgctgacccgc 
gacggcggcaaggagatcagcaacaccaccgagatctcccgccccggcggcggcgacatgcgcgac 
aactggcgcagcgagccgtacaagtacaaggtggtgaagatcgagcccccgggcgcggcccccacc 
aaggccaagcgccgcgtggtgcagcgcgagaagcgcgccgcgaccccgggcgccatgttcctgggc 
tccctgggcgccgccggcagcaccacgggcgcccgcagcctgacccccaccgcgcaggcccgccag 
ctgccgagcggcatcgtgcagcagcagaacaacctgctgcgcgccaticgaggcccagcagcacctg 
ctgcagctgaccgtgtggggcatcaagcagctgcaggcccgcgcgcccgccgtggagcgctacctg 
aaggaccagcagctgctgggcatctggggctgcagcggcaagctgacccgcaccaccgccgtgccc 
tggaacgccagctggagcaacaagagcctggaccagatctggaacaacatgacctggatggagtgg 
gagcgcgagatcgacaactacaccaacctgatccacaccctgatcgaggagagccagaaccagcag 
gagaagaacgagcaggagctgctggagctggacaagtgggccagcccccggaactggttcgacatc 
agcaagtggctgtggtacatctaactcgag 



Figure 6 



gpl40 .mut? .modSF162 .delV2 



gaattcgccaccatggatgcaatgaagagagggctctgctgtgtgctgctgctgtgtggagcagtc 
ttcgtttcgcccagcgccgtggagaagctgtgggcgaccgtgtactacggcgtgcccgtgtggaag 
gaggccaccaccaccctgttctgcgccagcgacgccaaggcctacgacaccgaggtgcacaacgtg 
tgggccacccacgcctgcgtgcccaccgaccccaacccccaggagaccgtgctggagaacgtgacc 
gagaacttcaacatgtggaagaacaacatggtggagcagacgcacgaggacatcatcagcctgtgg 
gaccagagcctgaagccctgcgtgaagctgacccccctgtgcgtgaccctgcactgcaccaacctg 
aagaacgccaccaacaccaagagcagcaactggaaggagacggaccgcggcgagatcaagaactgc 
agctccaaggtgggcgccggcaagctgatcaaccgcaacaccagcgtgatcacccaggcctgcccc 
aaggtgagcttcgagcccatccccatccactactgcgcccccgccggctccgccatcctgaagtgc 
aacgacaagaagttcaacggcagcggcccctgcaccaacgcgagcaccgcgcagtgcacccacggc 
atccgccccgtggtgagcacccagctgctgctgaacggcagcctggccgaggagggcgtggtgatc 
cgcagcgagaacttcaccgacaacgccaagaccatcatcgtgcagctgaaggagagcgtggagatc 
aactgcacccgccccaacaacaacacccgcaagagcatcaccatcggccccggccgcgccttctac 
gccaccggcgacatcatcggcgacatccgccaggcccactgcaacatcagcggcgagaagtggaac 
aacaccctgaagcagatcgtgaccaagctgcagccccagttcggcaacaagaccatcgtgttcaag 
cagagcagcggcggcgaccccgagatcgtgatgcacagcttcaactgcggcggcgagttcttctac 
tgcaacagcacccagctgttcaacagcacctggaacaacaccaccggccccaacaacaccaacggc 
accatcaccctgccctgccgcatcaagcagatcatcaaccgctggcaggaggtgggcaaggccatg 
tacgccccccccatccgcggccagatccgctqcagcagcaacatcaccggcctgctgctgacccgc 
gacggcggcaaggagatcagcaacaccaccgaga-cttccgccccggcggcggcgacatgcgcgac 
aactggcgcagcgagccgcacaagtacaaggtggzcaagaccgagcccctgggcgtggcccccacc 
aaggccatcagcagcgcggcgcagagcgagaagaccgccgzgaccctgggcgccatgttcctgggc 
ttcccgggcgccgccggcagcaccatgggcgcccccagcctgaccccgaccgtgcaggcccgccag 
ctgctgagcggcaccgtgcagcagcagaacaacc-gctgcgcgccaccgaggcccagcagcacctg 
ctgcagctgaccgtgtggggcatcaagcagctgcaggcccgcgtgctggccgtggagcgctacctg 
aaggaccagcagctgctgggcatctggggctgcagcggcaagctgatccgcaccaccgccgtgccc 
tggaacgccagctggagcaacaagagcctggaccagacctggaacaacacgacccggatggagtgg 
gagcgcgagatcgacaactacaccaacctgatctacaccctgatcgaggagagccagaaccagcag 
gagaagaacgagcaggagctgctggagctggacaaccgggccagcctgtggaactggctcgacatc 
agcaagtggctgcggtacatctaactcgag 



Figure 7 



gpl4 Omod . TVl . delV2 



1 gaattcatgc gcgtgatggg cacccagaag 
61 ctgggcttct ggatgctgat gatctgcaac 
121 ggcgtgcocg tgtggcgcga cgccaagacc 
181 tacgagaccg aggtgcacaa cgtgtgggcc 
241 ccccaggaga tcgtgctggg caacgtgacc 
301 gccgaccaga tgcacgagga cgtgatcagc 
361 aagctgaccc ccctgtgcgt gaccctgaac 
421 accgtgaccg gcaacagcac caacaacacc 
481 atgaagaact gcagcttcaa cgccggcgcc 
541 atcacccagg cctgccccaa ggtgagcttc 
601 gccggctacg ccatcctgaa gtgcaacaac 
661 aacgtgagca ccgtgcagtg caccoacggc 
721 ctgaacggca gcctggccga ggagggcatc 
781 accaagacca tcatcgtgca cctgaacgag 
841 aacaacaccc gcaagagcgt gcgcatcggc 
901 gtgatcggca acatccgcca ggcccactgc 
961 ctgcagoagg tgatgaagaa gctgggcgag 
1021 coccacgccg gcggcgacct ggagatcacc 
1081 ttctactgca acaccagcaa cctgttcaac 
1141 aagtacaacg gcaacagcag cagccccato 
1201 cgcatgtggc agggcgtggg ccaggccacc 
1261 tgccgcagca acatcaccgg catcctgctg 
1321 aacaccgaga ccttccgccc cggcggcggc 
1381 tacaagtaca aggtggtgga gatcaagccc 
1441 cgcgtggtgc agcgcgagaa gcgcgccgtg 
ISOl ggcgccgccg gcagcaccat gggcgcegcc 
1561 ctgctgagcg gcatcgtgca gcagcagagc 
1621 cacatgctgc agctgaccgt gtggggcatc 
1681 gagcgctacc tgaaggacca gcagctgctg 
1741 tgcaccaccg ccgtgccctg gaacagcagc 
1801 gacaacatga cctggatgca gtgggaccgc 
1861 aacctgctgg aggacagcca gaaccagcag 
1921 gacaagtgga acaacctgtg gaactggttc 
1981 ctcgag 



aactgccagc agtggtggat ctggggcatc 
accgaggacc tgtgggtgac cgtgtactac 
accctgttct gcgccagcga cgccaaggcc 
acccacgcct gcgtgcccac cgaccccaac 
gagaacttca acatgtggaa gaacgacatg 
ctgtgggacc agagcctgaa gccctgogtg 
tgcaccgaca ccaacgtgac cggcaaocgc 
aacggcaocg gcatctacaa catcgaggag 
ggccgcctga tcaactgcaa caccagcacc<i 
gaccccatcc ccatccacta ctgcgccccc 
aagaccttca acggcaccgg cccctgctac 
atcaagcccg tggtgagcac ccagctgctg 
atcatccgca gegagaacct gaccgagaac 
agcgtggaga tcaactgcac ccgccccaac 
cccggccagg ccttctacgc caccaacgac 
aacatcagca ccgaccgctg gaacaagacc 
cacttcccca acaagaccat ccagttcaag 
atgcacagct tcaactgccg cggcgagttc 
agcacctaco acagcaacaa cggcacctac 
accctgcagt gcaagatcaa gcagatcgtg 
tacgcccccc ccatcgccgg caacatcacc 
acccgcgacg gcggcttcaa caccaccaac 
gacatgcgcg acaactggcg cagegagctg 
ctgggcatcg cccccaccaa ggccaagcgc 
ggcatcggcg ecgtgttcct gggcttcctg 
agcatcaccc tgaccgtgca ggcccgccag 
aacctgctga aggccatcga ggcccagcag 
aagcagctgc aggcccgcgt gctggccatc 
ggcatctggg gctgoagcgg ccgcctgatc 
tggagcaaca agagcgagaa ggacatctgg 
gagatcagca actacaccgg cctgatctac 
gagaagaacg agaaggacct gctggagctg 
gacatcagca acbggccctg gtacatcibaa 



Figure 8 



gpl40mod.TVl.tout7.delV2 



1 gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 
61 ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 
121 ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 
181 tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 
241 ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 
301 gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 
361 aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 
421 accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 
481 atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 
541 atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 
601 gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 
661 aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 
721 ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 
781 accaagacca toatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 
841 aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 
901 gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 
961 ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 
1021 ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 
1081 ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 
1141 aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 
1201 cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatoacc 
1261 tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 
1321 aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 
1381 tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 
1441 agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 
1501 ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 
1561 ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 
1621 cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 
1681 gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 
1741 tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 
1801 gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 
1861 aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 
1921 gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatotaa 
1981 ctogs^ 



Figure 9 



FIGURE 10 

gpl60mod.Q23-17 

1 ATGCGCGTGA TGGGCATCCA 
61 CTGGGCACCA TCATCTTCTG 
121 GTGCCCGTGT GGCGCGACGC 
181 GAGACCGAGA AGCACAACGT 
241 CAGGAGATCC ACCTGGACAA 
301 GAGCAGATGC ACACCGACAT 
361 CTGACCCCCC TGTGCGTGAC 
421 GACCGCGAGG GCCTGAAGAA 
481 CA6AAGGTGT ACAGCCTGTT 
541 AGCGAGTACC GCCTGATCAA 
601 AGCTTCGAGC CCATCCCCAT 
661 AAGGACGAGG GCTTCAACGG 
721 CACGGCATCA AGCCCGTGGT 
781 AACATCACCA TCCGCAGCGA 
841 GTGCAGCCCG TGACCATCAA 
901 ATCGGCCCCG GCCAGGCCTT 
961 CACTGCAACG TGACCCGCAG 
1021 CGCACCTACT TCGGCAACAA 
1081 ATCACCACCC ACAGCTTCAA 
1141 TTCAACAGCA CCTGGTACGT 
1201 AACGACACCA TCACCCTGCC 
1261 GGCCAGGCCA TGTACGCCCC 
1321 GGCCTGCTGC TGACCCGCGA 
1381 GGCGGCGGCG ACATGCGCGA 
1441 ATCGAGCCCC TGGGCGTGGC 
1501 CGCGCCGTGG GCATCGGCGC 
1561 GGCGCCACCA GCATCACCCT 
1621 CAGCAGAACA ACCTGCTGCG 
1681 TGGGGCATCA AGCAGCTGCA 
1741 CAGCTGCTGG GCATCTGGGG 
1801 AACAGCAGCT GGAGCAACAA 
1861 TGGGACAAGG AGATCAACAA 
1921 AACCAGCAGG AGAAGAACGA 
1981 AGCTGGTTCG ACATCAGCAA 
2041 GGCCTGATCG GCCTGCGCAT 
2101 GGCTACAGCC CCCTGAGCTT 
2161 GAGCGCATCG AGGAGGAGGA 
2221 GGCTTCCTGG CCCTGGCCTG 
2281 CTGCGCGACT TCATCCTGAT 
2341 AAGGGCCTGC GCCTGGGCTG 
2401 GGCCGCGAGC TGAAGATCAG 
2461 GGCTGGACCG ACCGCGTGAT 
2521 CCCGTGCGCA TCCGCCAGGG 



GCGCAACTGC CAGCACCTGC 
CAGCGCCGTG GAGAACCTGT 
CGACACCACC CTGTTCTGCG 
GTGGGCCACC CACGCCTGCG 
CGTGACCGAG AAGTTCAACA 
CATCAGCCTG TGGGACCAGA 
CCTGCACTGC ACCAACGTGA 
CTGCAGCTTC AACATGACCA 
CTACCGCCTG GACATCGTGC 
CTGCAACACC AGCGCCATCA 
CCACTACTGC ACCCCCGCCG 
CACCGGCCTG TGCAAGAACG 
GAGCACCCAG CTGCTGCTGA 
GAACATCACC AACAACGCCA 
GTGCATCCGC CCCAACAACA 
CTACGCCACC GGCGACATCA 
CCGCTGGAAC AAGACCCTGC 
GACCATCATC TTCGCCAACA 
CTGCGGCGGC GAGTTCTTCT 
GAACAGCACC TGGAACGACA 
CTGCCGCATC AAGCAGATCA 
CCCCATCCCC GGCGTGATCA 
CGGCGGCAAG GACAACAACG 
CAACTGGCGC AGCGAGCTGT 
CCCCACCCGC GCCAAGCGCC 
CGTGTTCCTG GGCTTCCTGG 
GACCGTGCAG GCCCGCCAGC 
CGCCATCGAG GCCCAGCAGC 
GGCCCGCGTG CTGGCCGTGG 
CTGCAGCGGC AAGCTGATCT 
GAGCCTGGAC GAGATCTGGA 
CTACACCCAG CTGATCTACC 
GAAGGAGCTG CTGGAGCTGG 
CTGGCTGTGG TACATCAAGA 
CGTGTTCGCC GTGCTGAGCG 
CCAGACCCAC ACCCCCAACC 
CGGCGAGCAG GGCCGCGGCC 
GGACGACCTG CGCAGCCTGT 
CGCCGCCCGC ACCGTGGAGC 
GGAGGGCATC AAGTACCTGT 
CGCCATCAAC CTGGTGGACA 
CGAGATCGCC CAGCGCATCG 
CCTGGAGCGC GCCCTGCTGT 



TGACCTGGGG CATCATGATC 
GGGTGACCGT GTACTACGGC 
CCAGCGACGC CAAGGCCTAC 
TGCCCACCGA CCCCAACCCC 
TGTGGAAGAA CAACATGGTG 
GCCTGAAGCC CTGCGTGAAG 
CCAGCGTGAA CACCACCGGC 
CCGAGCTGCG CGACAAGCGC 
CCATCAACGA GAACCAGGGC 
CCCAGGCCTG CCCCAAGGTG 
GCTTCGCCAT CCTGAAGTGC 
TGAGCACCGT GCAGTGCACC 
ACGGCAGCCT GGCCGAGAAG 
AGATCATCAT CGTGCAGCTG 
ACACCCGCAA GAGCATCCGC 
TCGGCGACAT CCGCCAGGCC 
AGGAGGTGGC CGAGAAGCTG 
GCAGCGGCGG CGACCTGGAG 
ACTGCAACAC CAGCGGCCTG 
CCGACAGCAC CCAGGAGAGC 
TCAACATGTG GCAGCGCGCC 
AGTGCGAGAG CAACATCACC 
TGAACGAGAC CTTCCGCCCC 
ACAAGTACAA GGTGGTGGAG 
GCGTGGTGGA GCGCGAGAAG 
GCGCCGCCGG CAGCACCATG 
TGCTGAGCGG CATCGTGCAG 
ACCTGCTGAA GCTGACCGTG 
AGCGCTACCT GCGCGACCAG 
GCACCACCAA CGTGCCCTGG 
ACAACATGAC CTGGCTGCAG 
GCCTGATCGA GGAGAGCCAG 
ACAAGTGGGC CAACCTGTGG 
TCTTCATCAT CATCGTGGGC 
TGATCAACCG CGTGCGCCAG 
CCCGCGGCCT GGACCGCCCC 
GCAGCATCCG CCTGGTGAGC 
GCCTGTTCAG CTACCACCGC 
TGCTGGGCCA CAGCAGCCTG 
GGAACCTGCT GAGCTACTGG 
CCATCGCCAT CGCCGTGGCC 
GCCGCGCCAT CCTGCACATC 
AA 



FIGURE 11 

gpl60mod.98UA0116 



1 ATGAAGGCCC GCGGCATGCA GCGCAACTAC 
61 TTCTGGATGA TCATCATGTG CAAGGCCGCC 
121 GTGCCCGTGT GGCGCGACGC CGAGACCACC 
181 GACAAGGAGG TGCACAACGT GTGGGCCACC 
241 CAGGAGATCA TCCTGGAGAA CGTGACCGAG 
301 GAGCAGATGC AGACCGACAT CATCAGCCTG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC 
421 AACAGCAACA GCAACGACAA CTGGAGCGAG 
481 ACCGAGCTGC GCGACAAGCG CAAGACCGTG 
541 AGCACCGGCA GCAACGACAG CCGCCAGTAC 
601 ACCCAGGCCT GCCCCAAGGT GACCTTCGAG 
661 GGCTTCGCCA TCCTGAAGTG CAAGGACACC 
721 GTGAGCACCG TGCAGTGCAC CCACGGCACC 
781 AACGGCAGCC TGGCCGAGAA GGAGGTGATG 
841 AAGATCATCA TCGTGCAGCT GACCGAGCCC 
901 AACAAGCGCA CCAGCATCCG CATCGGCCCC 
961 ATCGGCGACA TCCGCAAGGC CTACTGCAAC 
1021 CAGAAGATCA GCACCCAGCT GCGCCAGTAC 
1081 AGCAGCGGCG GCGACCTGGA GGTGACCACC 
1141 TACTGCAACA CCACCGACCT GTTCAACAGC 
1201 AGCACCATGG CCAACGGCAC CATCACCCTG 
1261 TGGCAGCGCG TGGGCCAGGC CATGTACGCC 
1321 AGCAACATCA CCGGCCTGCT GCTGACCCGC 
1381 GAGACCTACC GCCCCATCGG CGGCAACATG 
1441 TACAAGGTGG TGAAGATCGA GCCCATCGGC 
1501 GTGGAGCGCG AGAAGCGCGC CATCGGCCTG 
1561 GCCGGCAGCA CCATGGGCGC CGCCAGCATG 
1621 AGCGGCATCG TGCAGCAGCA GAGCAACCTG 
1681 CTGAAGCTGA CCGTGTGGGG CATCAAGCAG 
1741 TACCTGAAGG ACCAGCAGCT GCTGGGCATC 
1801 ACCAACGTGC CCTGGAACAG CAGCTGGAGC 
1861 ATGACCTGGA TGCAGTGGGA CCGCGAGGTG 
1921 ATCGAGGAGA GCCAGAACCA GCAGGAGAAG 
1981 TGGGCCAGGC TGTGGAGCTG GTTCGACATC 
2041 ATCATCATCG TGGGCGGCCT GATCGGCCTG 
2101 AACCGCGCCG GCCAGGGCTA CAGCCCCCTG 
2161 GGCCCCGACC GCCCCGGCCG CATCAAGGAG 
2221 ATCCGCCTGG TGAGCGGCTT CCTGGCCCTG 
2281 TTCAGCTACC GCCGCCTGCG CGACTTCATC 
2341 GGCCGCAGCA GCCTGAAGGG CCTGCGCCTG 
2401 CTGCTGGGCT ACCGCGGCCA GGAGCTGAAG 
2461 GCCATCGCCG TGGCCGGCTG GACCGACCGC 
2521 GCCATCCGCA ACATCCCCCG CCGCATCCGC 



CAGCACCTGT GGCGCTGGGG CACCATGCTG 
GAGAACCTGT GGGTGACCGT GTACTACGGC 
CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 
CACGCCTGCG TGCCCACCGA CCCCGACCCC 
AAGTTCAACA TGTGGAAGAA CAACATGGTG 
TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
GCCGGCCCCA GCAGCAACAA CAGCAACGTG 
GAGATGAAGA ACTGCAGCTT CAACATGACC 
CACAGCCTGT TCTACAAGCT GGACATCGTG 
CGCCTGATCA ACTGCAACAC CAGCGCCATG 
CCCATCCCCA TCCACTACTG CGCCCCCGCC 
AACTTCACCG GCACCGGCCC CTGCAAGGAC 
AAGCCCGTGG TGAGCACCCA GCTGCTGCTG 
ATCCGCAGCG AGAACATCAC CGACAACGGC 
GTGAACATCA CCCGCATCCG CCCCGGCGAG 
GGCCAGACCT TCTACGCCAC CGGCGACGTG 
GTGAGCCGCG CCGCCTGGAA CAGCACCCTG 
TTCAACAACA AGACCATCAT CTTCAAGAAC 
CACAGCTTCA ACTGCGGCGG CGAGTTCTTC 
ACCTGGAACG AGCACGGCCC CGTGACCAAC 
CCCTGCCGCA TCAAGCAGAT CATCAACATG 
CCCCCCATCG AGGGCAACAT CCGCTGCGAG 
GACGGCGGCA GCGGCGCCAA CAGCAGCAAG 
CGCGACAACT GGCGCAGCGA GCTGTACAAG 
GTGGCCCCCA CCAAGGCCAA GCGCCGCGTG 
GGCGCCGCCT TCCTGGGCTT CCTGGGCGCC 
ACCCTGACCG TGCAGGCCCG CCAGCTGCTG 
CTGCGCGCCA TCGAGGCCCA GCAGCACCTG 
CTGCAGGCCC GCGTGCTGGC CGTGGAGCGC 
TGGGGCTGCA GCGGCAAGCT GATCTGCACC 
AACAAGAGCC AGAGCGAGAT CTGGGGCAAC 
ATCAACTACA CCAACATCAT CTACGACCTG 
AACGAGCAGG ACCTGCTGGC CCTGGACAAG 
AGCAACTGGC TGTGGTACAT CAAGATCTTC 
CGCATCGTGT TCGCCGTGCT GAGCATCATC 
AGCCTGCAGA CCCTGACCCC CCACCCCGAG 
GAGGGCGGCG AGCAGGACCG CGACCGCAGC 
GCCTGGGACG ACCTGCGCAG CCTGTGCCTG 
AGCATCGCCG CCCGCACCGT GGAGCTGCTG 
GGCTGGGAGG GCCTGAAGTA CCTGGGCAAC 
AGCAGCGCCA TCAACCTGAT CGACACCATC 
GTGATCGAGA TCGGCCAGCG CTTCTGCCGC 
GAGGGCGGCG AGCGCGCCCT GCAGTAA 



FIGURE 12 

gpl60mod.SE8538 

1 ATGCGCGTGA AGGGCATCCA 
61 CTGGGCATGA TCATCATCTG 
121 GTGCCCGTGT GGAAGGACGC 
181 GACACCGAGG TGCACAACGT 
241 CAGGAGCTGC ACCTGGCCAA 
301 GAGCAGATGC ACACCGACAT 
361 CTGACCCCCC TGTGCGTGAC 
421 AGCCACAGCT ACAACGTGAC 
481 ACCGAGCTGC GCGACAAGCG 
541 CCCATCGGCG GCAACGACAC 
601 GCCATCACCC AGGCCTGCCC 
661 CCCGCCGGCT TCGCCATCCT 
721 AAGAACGTGA GCACCGTGCA 
781 CTGCTGAACG GCAGCCTGGC 
841 AACGTGAAGA ACATCATCGT 
901 GGCAACAACA CCCGCAAGAG 
961 GAGGTGATCG GCGACATCCG 
1021 ACCCTGCACG AGGTGGCCAA 
1081 ACCAACAGCA GCGGCGGCGA 
1141 TTCTTCTACT GCAACACCAG 
1201 CCCATGAGCA ACAGCACCGA 
1261 ATCATCAACA TGTGGCAGCG 
1321 ATCAAGTGCG TGAGCAACAT 
1381 AGCACCAACG AGACCTTCCG 
1441 CTGTACAAGT ACAAGGTGGT 
1501 CGCCGCGTGG TGGAGCGCGA 
1561 CTGGGCGCCG CCGGCAGCAC 
1621 CAGCTGCTGA GCGGCATCGT 
1681 CT^GCACCTGC TGAAGCTGAC 
1741 GTGGAGCGCT ACCTGAAGGA 
1801 ATCTGCACCA CCAACGTGCC 
1861 TGGGACAACA TGACCTGGCT 
1921 TACCGCCTGA TCGAGGAGAG 
1981 CTGGACAAGT GGGCCAGCCT 
2041 CGCATCTTCA TCATGATCGT 
2101 AGCGTGATCA ACCGCGTGCG 
2161 AACCCCGGCG ACCTGGACCG 
2221 GGCCGCAGCA TCCGCCTGGT 
2281 CTGTGCCTGT TCAGCTACCA 
2341 GAGCTGCTGG GCCAGCGCGG 
2401 TGGATCCGCG AGCTGAAGAT 
2461 GCCGGCTGGA CCGACCGCGT 
2521 ATCCCCGTGC GCATCCGCCA 



GCGCAACAGC CAGGAGCTGC 
CAGCACCGCC GACAAGCTGT 
CGAGACCACC CTGTTCTGCG 
GTGGGCCACC CACGCCTGCG 
CGTGACCGAG GAGTTCAACA 
CATCAGCCTG TGGGACCAGA 
CCTGGAGTGC AACGACTACA 
CAACATGCAG GAGATGAAGA 
CCAGAAGGTG ACCAGCCTGT 
CAACAGCACC CAGTACCGCC 
CAAGGTGACC TTCGAGCCCA 
GAAGTGCCGC GACGAGAACT 
GTGCACCCAC GGCATCAAGC 
CCGCGAGAAG GTGATGATCC 
GCAGCTGAAG GAGCCCGTGG 
CATCCGCATC GGCCCCGGCC 
CCAGGCCCAC TGCAACGTGA 
GCAGCTGCGC ACCTACTTCA 
CCTGGAGATC ACCACCCACA 
CGGCCTGTTC AACAGCACCT 
GAGCAACGAC ACCATCACCC 
CGCCGGCAAG GCCATCTACG 
CACCGGCCTG ATCCTGACCC 
CCCCGGCGGC GGCGACATGC 
GAAGATCGAG CCCCTGGGCG 
GAAGCGCGCC ATCGGCATCG 
CATGGGCGCC GCCAGCATCA 
GCAGCAGCAG AGCAACCTGC 
CGTGTGGGGC ATCAAGCAGC 
CCAGCAGCTG CTGGGCATCT 
CTGGAACAGC AGCTGGAGCA 
GCAGTGGGAC AAGGAGATCA 
CCAGAACCAG CAGGAGAAGA 
GTGGAACTGG TTCGACATCA 
GGGCGGCCTG ATCGGCCTGC 
CCAGGGCTAC AGCCCCCTGA 
CCCCGGCCGC ATCGAGGAGG 
GAGCGGCTTC CTGGCCCTGG 
CCGCCTGCGC GACTTCATCC 
CTGGGAGGGC CTGAAGTACC 
CAGCGCCATC AGCCTGCTGG 
GATCGAGCTG GGCCAGCGCC 
GGGCTTCGAG CGCGCCCTGC 



TGCGCTGGGG CACCATGATC 
GGGTGACCGT GTACTACGGC 
CCAGCGACGC CAAGGCCTAC 
TGCCCACCGA CCCCAACCCC 
TGTGGAAGAA CAGCATGGTG 
GCCTGATCCC CTGCGTGAAG 
ACTACAACGT GACCAACAGC 
ACTGCAGCTT CAACGTGACC 
TCTACAAGCT GGACGTGGTG 
TGATCAACTG CAACACCAGC 
TCCCCATCCA CTACTGCGCC 
TCAACGGCAC CGGCCCCTGC 
CCGTGGTGAG CACCCAGCTG 
GCAGCGAGAA CATCACCAAC 
AGATCAACTG CACCCGCCCC 
AGGCCTTCTA CGCCACCGGC 
GCCGCGCCAA GTGGAACAAG 
ACAACAAGAC CATCATCTTC 
CCGTGAACTG CGGCGGCGAG 
GGAGCAGCAA CGCCAGCGAG 
TGCAGTGCCG CATCCGCCAG 
CCCCCCCCAT CCCCGGCATC 
GCGACGGCGG CAGCAACAAC 
GCGACAACTG GCGCAGCGAG 
TGGCCCCCAC CAAGGCCAAG 
GCGCCGTGTT CATCGGCTTC 
CCCTGACCGT GCAGGCCCGC 
TGCGCGCCAT CGAGGCCCAG 
TGCAGGCCCG CGTGCTGGCC 
GGGGCTGCAG CGGCAAGCTG 
ACAAGAGCCA GAGCGAGATC 
GCAACTACAC CCAGACCATC 
ACGAGCAGGA CGTGCTGGCC 
GCCGCTGGCT GTGGTACATC 
GCATCGTGTT CGCCGTGCTG 
GCTTCCAGAT CCACACCCCC 
AGGGCGGCGA GCAGGACCGC 
CCTGGGACGA CCTGCGCAGC 
TGATCGCCGC CCGCACCGTG 
TGTGGAACCT GCTGGTGTAC 
ACACCATCGC CATCGCCGTG 
TGTGCCGCGC CATCCTGCAC 
TGTAA 



FIGURE 13 

gpl60niod.UG031 



1 ATGCGCGTGC GCGGCATCCA GACCAGCTGG 
61 CTGGGCATGC TGATGATCTA CAGCGCCGCC 
121 GTGCCCGTGT GGAAGGACGC CGAGACCACC 
181 GACACCGAGG TGCACAACGT GTGGGCCACC 
241 CAGGAGATCC ACCTGGAGAA CGTGACCGAG 
301 GAGCAGATGC ACACCGACAT CATCAGCCTG 
361 CTGACCCCCC TGTGCGTGAC CCTGGACTGC 
421 AACGTGACCA ACGACATGGA GGGCGAGATG 
481 CTGAAGGACA AGAAGCAGCA GGTGTACAGC 
541 AACGAGAAGA ACAAGACCAA CAAGTACCGC 
601 CAGGCCTGCC CCAAGGTGAG CTTCGAGCCC 
661 TTCGCCATCC TGAAGTGCAA GGACACCGAG 
721 AGCACCGTGC AGTGCACCCA CGGCATCCGC 
781 GGCAGCCTGG CCGAGGGCGG CATCCAGATC 
841 ACCATCATCG TGCAGCTGGA CAAGGCCGTG 
901 ACCCGCAAGA GCGTGCGCAT CGGCCCCGGC 
961 GGCGACATCC GCCAGGCCCA CTGCAACGTG 
1021 GGCATCGCCA AGAAGCTGAG CGAGCACTTC 
1081 AGCGGCGGCG ACATCGAGAT CACCACCCAC 
1141 TGCAACACCA GCGGCCTGTT CAACGGCACC 
1201 ACCACCCCCA ACGACACCAT CACCCTGACC 
1261 CAGAAGGTGG GCCAGGCCAT GTACGCCCCC 
1321 AACATCACCG GCCTGCTGCT GACCCGCGAC 
1381 CGCCCCGGCG GCGGCAACAT GCGCGACAAC 
1441 GTGAAGATCG AGCCCCTGGG CGTGGCCCCC 
1501 GAGAAGCGCG CCGTGGGCAT CGGCGCCGTG 
1561 ACCATGGGCG CCGCCAGCAT CACCCTGACC 
1621 GTGCAGCAGC AGAGCAACCT GCTGCGCGCC 
1681 ACCGTGTGGG GCATCAAGCA GCTGCAGGCC 
1741 GACCAGCAGC TGCTGGGCAT CTGGGGCTGC 
1801 CCCTGGAACA GCAGCTGGAG CAACAAGAGC 
1861 CTGCAGTGGG AGAAGGAGAT CAGCAACTAC 
1921 AGCCAGAACC AGCAGGAGAA GAACGAGCAG 
1981 CTGTGGAACT GGTTCGACAT CAGCCGCTGG 
2041 GTGGGCGGCC TGATCGGCCT GCGCATCGTG 
2101 CGCCAGGGCT ACAGCCCCCT GAGCTTCCAG 
2161 CGCCTGGGCC GCATCGGCGA GGAGGGCGGC 
2221 GTGAGCGGCT TCCTGGCCCT GGCCTGGGAC 
2281 CACCGCCTGC GCGACTTCAT CAGCATCGCC 
2341 AGCCTGAAGG GCCTGCGCCT GGGCTGGGAG 
2401 TACTGGGGCC TGGAGCTGAA GACCAGCGCC 
2461 GTGGCCGGCT GGACCGACCG CGTGATCGAG 
2521 AACATCCCCC GCCGCATCCG CCAGGGCCTG 



CAGAACCTGT GGCGCTGGGG CACCATGATC 
GAGAACCTGT GGGTGACCGT GTACTACGGC 
CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 
CACGCCTGCG TGCCCACCGA CCCCAACCCC 
GACTTCAACA TGTGGAAGAA CAACATGGTG 
TGGGACCAGA GCCTGAAGCC CTGCGTGGAG 
CTGAACGCCA CCCTGAACGC CACCGCCCCC 
AAGAACTGCA GCTACAACAT CACCACCGAG 
CTGTTCTACA AGCTGGACGT GGTGCAGATC 
CTGATCAACT GCAACACCAG CGCCATCACC 
ATCCCCATCC ACTACTGCGC CCCCGCCGGC 
TTCAACGGCA CCGGCCCCTG CAAGAACGTG 
CCCGTGATCA GCACCCAGCT GCTGCTGAAC 
CGCAGCGAGA ACATCACCAA CAACGCCAAG 
AAGATCAACT GCACCCGCCC CAACAAC7UVC 
CAGGCCTTCT ACGCCACCGG CGACATCATC 
AGCCGCGCCA AGTGGAACGA GACCCTGCGC 
AAGAACAAGA TCATCATCTT CGAGAAGAGC 
AGCTTCAACT GCGGCGGCGA GTTCTTCTAC 
TGGAAGCCCA ACAGCACCGA GAGCAACAAC 
TGCCGCATCA AGCAGATCAT CAACATGTGG 
CCCATCCAGG GCGTGATCCG CTGCGAGAGC 
GGCGGCATCA ACAGCATCAA CGAGACCTTC 
TGGCGCAGCG AGCTGTACAA GTACAAGGTG 
AGCCGCGCCA AGCGCCGCGT GGTGGAGCGC 
TTCCTGGGCT TCCTGGGCGC CGCCGGCAGC 
GCCCAGGCCC GCCAGCTGCT GAGCGGCATC 
ATCAAGGCCC AGCAGCACAT GCTGAAGCTG 
CGCGTGCTGG CCGTGGAGCG CTACCTGAAG 
AGCGGCAAGC TGATCTGCAC CACCAACGTG 
ATGAACGAGA TCTGGGACAA CATGACCTGG 
ACCCAGCTGA TCTACAACCT GATCGAGGAG 
GACCTGCTGG CCCTGGACAA GTGGGCCAGC 
CTGTGGTACA TCAAGATCTT CATCATGATC 
TTCGCCGTGC TGAGCGTGAT CAACCGCGTG 
ATCCGCACCC CCAACCCCGA GGAGCCCGAC 
GAGCAGGACC GCGACCGCAG CATCCGCCTG 
GACCTGCGCA GCCTGTGCCT GTTCAGCTAC 
GCCCGCACCG TGGAGCTGCT GGGCCACAGC 
GGCCTGAAGT ACCTGTGGAA CCTGCTGCTG 
GTGAACCTGG TGGACACCAT CGCCATCGCC 
ATCGGCCAGC GCATCTTCCG CGCCATCCTG 
GAGCGCGGCC TGCTGTAA 



FIGtJRE 14 

gpl60mod.92UG001 



1 ATGCGCGTGC GCGAGATCGA GCGCAACTAC 
61 CTGGGCATGC TGATGACCTA CAGCGTGGCC 
121 GTGCCCGTGT GGAAGGAGGC CACCACCACC 
181 AAGACCGAGG TGCACAACAT CTGGGCCACC 
241 CGCGAGATCG AGCTGGAGAA CGTGACCGAG 
301 GAGCAGATGC ACGAGGACAT CATCAGCCTG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACT6C 
421 AACATCACCG GCATGGAGAA CAACGACCAG 
481 ACCACCAAGC TGATCGACAA GAAGAAGCAG 
541 GTGCAGATCG ACAACGACAC CAGCAACAGC 
601 AACACCAGCG CCATCACCCA GGCCTGCCCC 
661 TACTGCGCCC CCGCCGGCTT CGCCATCCTG 
721 GGCCCCTGCA AGAACGTGAG CACCGTGCAG 
781 ACCCAGCTGC TGCTGAACGG CAGCGTGGCC 
841 CTGACCAACA ACGCCAAGAC CCTGATCGTG 
901 ACCCGCCCCT ACTACAACCA GATCCGCCAG 
961 TACACCACCC GCGTGACCGG CGACATCCGC 
1021 TGGAACAAGA CCCTGCAGCA GGTGGCCAAG 
1081 ATCATCTTCA AGCCCAGCAG CGGCGGCGAC 
1141 GGCGGCGAGT TCTTCTACTG CAACACCAGC 
1201 ACCTGGAACA TCGGCAACAA CAACACCGGC 
1261 CGCATCAAGC AGATCATCAA CATGTGGCAG 
1321 ATCGAGGGCT GGATCAACTG CGCCAGCAAC 
1381 GGCGGCGCCA ACGACAGCCA GAACGAGACC 
1441 AACTGGCGCA GCGAGCTGTA CAAGTACAAG 
1501 CCCACCAAGG CCAAGCGCCG CGTGGTGGAG 
1561 ATGTTCCTGG GCTTCCTGGG CGCCGCCGGC 
1621 ACCGTGCAGG CCCGCCAGCT GCTGAGCGGC 
1681 GCCATCGAGG CCCAGCAGCA CCTGCTGCAG 
1741 GCCCGCATCC TGGCCGTGGA GCGCTACCTG 
1801 TGCAGCGGCC GCCACATCTG CACCACCACC 
1861 AGCATCGACG ACATCTGGAA CAACATGACC 
1921 TACACCGGCG TGATCTACCG CCTGATCGAG 
1981 CAGGAGCTGC TGCAGCTGGA CAAGTGGGCC 
2041 TGGCTGTGGT ACATCAAGAT CTTCATCATG 
2101 GTGTTCACCG TGCTGAGCCT GGTGAACCGC 
2161 CAGACCCTGT TCCCCGCCCC CCGCGGCCCC 
2221 GGCGAGCAGG GCCGCGGCCG CAGCACCCGC 
2281 GACGACCTGC GCAACCTGTG CCTGTTCAGC 
2341 GCCACCCGCA TCGTGGAGCT GCTGGGCCGC 
2401 AACCTGCTGC AGTACTGGAG CCAGGAGCTG 
2461 ACCGCCGTGG CCGTGGCCGA GGGCACCGAC 
2521 CGCGGCATCC TGAACGTGCC CACCCGCATC 



CTGTGCCTGT GGCGCTGGGG CATCATGCTG 
GAGAAGAAGT GGGTGACCGT GTACTACGGC 
CTGTTCTGCG CCAGCGACGC CAAGAGCTAC 
CACGCCTGCG TGCCCACCGA CCCCAACCCC 
AACTTCAACA TGTGGAAGAA CAACATGGTG 
TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
ACCGACGCCC GCCGCAACGA GACCCGCAAC 
ATCGAGATGA AGAACTGCAG CTTCAACATC 
GTGCACGCCC TGTTCTACCG CCTGGACGTG 
AACTACAGCA ACTACCGCCT GATCAACTGC 
AAGGTGACCT TCGAGCCCAT CCCCATCCAC 
AAGTGCCGCG ACAAGAAGTT CAACGGCACC 
TGCACCCACG GCATCCGCCC CGTGGTGAGC 
GAGGAGGAGA TCATCATCCG CAGCGAGAAC 
CAGCTGAACG AGAGCGTGGA GATCAACTGC 
CGCACCAGCA TCGGCCAGGG CCAGGCCCTG 
AAGGCCTACT GCAACATCAG CAAGGCCGGC 
AAGCTGGGCG ACCTGTTCAA CCAGACCACC 
CCCGAGATCA CCACCCACAG CTTCAACTGC 
AAGCTGTTCA ACAGCGCCTG GAACGACAGC 
AGCGACAACG AGACCATCAT CATCCCCTGC 
GGCGTGGGCA AGGCCATGTA CGCCCCCCCC 
ATCACCGGCC TGCTGCTGGT GCGCGACGGC 
TTCCGCCCCC AGGGCGGCGA CATGCGCGAC 
GTGGTGAAGA TCGAGCCCCT GGGCATCGCC 
CGCGAGAAGC GCGCCATCGG CCTGGGCGCC 
AGCACCATGG GCGCCGCCAG CCTGACCCTG 
ATCGTGCAGC ACCAGAACAA CCTGCTGATG 
CTGACCGTGT GGGGCATCAA GCAGCTGCAG 
CAGGACCAGC AGCTGCTGGG CAGCTGGGGC 
GTGCCCTGGA ACAGCAGCTG GAGCAACAAG 
TGGATGGAGT GGGAGAAGGA GATCGACAAC 
GAGAGCCAGA CCCAGCAGGA GAAGAACGAG 
AGCCTGTGGA ACTGGTTCAG CATCACCAAG 
ATCGTGGGCG GCCTGATCGG CCTGCGCATC 
GTGCGCCAGG GCTACAGCCC CCTGAGCTTC 
GACCGCCCCG AGGAGATCGA GGAGGGCGGC 
CTGGTGAACG GCTTCAGCAC CCTGATCTGG 
TACCACCGCC TGCGCGACCT GATCCTGATC 
CGCGGCTGGG AGGCCATCAA GTACCTGTGG 
AAGACCAGCG CCATCAGCCT GTTCAACGCC 
CGCGTGATCG AGGTGGTGCA GCGCTTCTTC 
CGCCAGGGCC TGGAGCGCGC CCTGCTGTAA 



FIGURE 15 

gpl60mod.94UG114 



1 ATGCGCGTGC GCGAGACCAA GCGCAACTAC CAGCACCTGT 
61 CTGGGCATGC TGATGATCTG CAGCGTGACC GGCAAGAGCT 
121 GTGCCCGTGT GGAAGGAGGC CACCACCACC CTGTTCTGCG 
181 AAGGCCGAGG CCCACAACAT CTGGGCCACC CACGCCTGCG 
241 CAGGAGATCA AGCTGGAGAA CGTGACCGAG AACTTCAACA 
301 GAGCAGATGC ACGAGGACAT CATCAGCCTG TGGGACCAGA 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC ACCAACTGGG 
421 ACCGGCATGG CCAACTGCAG CTTCAACATC ACCACCGAGA 
481 GTGCAGGCCC TGTTCTACAA GCTGGACGTG GTGAAGATCA 
541 ACCAGCTACC GCCTGATCAA CTGCAACACC AGCGCCATCA 
601 ACCTTCGAGC CCATCCCCAT CCACTACTGC GCCCCCGCCG 
661 AACGAGAAGA AGTTCAACGG CACCGGCCCC TGCAAGAACG 
721 CACGGCATCA AGCCCGTGGT GAGCACCCAG CTGCTGCTGA 
781 GAGATCATCA TCCGCAGCGA GAACCTGACC AACAACGCCA 
841 AACGAGAGCG TGCCCATCAA CTGCATCCGC CCCTACAACA 
901 ATCGGCCCCG GCCAGGCCCT GTTCACCACC AAGGTGATCG 
961 TGCAACATCA GCGGCGCCGG CTGGAACAAG ACCCTGCAGC 
1021 AACCTGCTGA ACCAGACCAC CATCATCTTC AAGCCCAGCA 
1081 ACCACCCACA GCTTCAACTG CGGCGGCGAG TTCTTCTACT 
1141 AACAGCACCT GGAAGCGCAA CAACAGCGAG TGGCGCAGCG 
1201 ATCACCCTGC AGTGCCGCAT CAAGCAGATC ATCAACATGT 
1261 ATGTACGCCC CCCCCATCGA GGGCTTCATC AACTGCAGCA 
1321 CTGACCCGCG ACGGCGGCGC CATCAACAGC AGCCA6AACG 
1381 GGCGACATGC GCAACAACTG 6CGCAGCGAG CTGTACAAGT 
1441 CCCATCGGCC TGGCCCCCAC CGCCGCCAAG CGCCGCGTGG 
1501 ATCGGCCTGG GCGCCCTGTT CCTGGGCTTC CTGGGCACCG 
1561 GTGAGCCTGA CCCTGACCGT GCAGGCCCGC CAGGTGCTGA 
1621 AACAACCTGC TGCGCGCCAT CGAGGCCCAG CAGCACCTGC 
1681 ATCAAGCAGC TGCAGGCCCG CATCCTGGCC GTGGAGAGCT 
1741 CTGGGCATCT GGGGCTGCAG CGGCAAGCAC ATCTGCACCA 
1801 AGCTGGAGCA ACCGCAGCGT GGACGAGATC TGGAACAACA 
1861 CGCGAGATCG ACAACTACAC CGAGCTGGTG TACAGCCTGC 
1921 CAGGAGAAGA ACGAGCAGGA GCTGCTGAAG CTGGACACCT 
1981 TTCAGCATCA CCCAGTGGCT GTGGTACATC AAGATCTTCA 
2041 ATCGGCCTGG GCATCGTGTT CGCCGTGCTG AGCGTGGTGA 
2101 AGCCCCCTGA GCTTCCAGAC CCTGCTGCCC GCCCCCCGCG 
2161 ATCGAGGAGG AGGGCGGCGA GCGCGACCGC GGCCGCAGCA 
2221 AGCGCCCTGA TCTGGGACGA CCTGCGCAAC CTGTGCCTGT 
2281 GACCTGATCC TGATCGCCGC CCGCATCGTG GAGCTGCTGG 
2341 ATCAAGTACC TGTGGAACCT GCTGCAGTAC TGGATCCAGG 
2401 AGCCTGTTCA ACACCATCGC CATCGCCGTG GCCGAGGGCA 
2461 GTGCAGCGCG CCGTGCGCGC CATCCTGAAC ATCCCCGTGC 
2521 CGCGCCCTGC TGTAA 



GGAAGTGGGG CACCATGCTG 
GGGTGACCGT GTACTACGGC 
CCAGCGACGC CAAGGCCTAC 
TGCCCACCGA CCCCAACCCC 
TGTGGAAGAA CAACATGGTG 
GCCTGAAGCC CTGCGTGAAG 
TGACCGACAC CACCAACACC 
TCCGCGACAA GAAGAAGCAG 
ACGACAACGA CAGCGACAAC 
CCCAGGCCTG CCCCAAGATG 
GCTTCGCCAT CCTGAAGTGC 
TGAGCACCGT GCAGTGCACC 
ACGGCAGCCT GGCCGAGGAG 
AGATCATCAT CGTGCAGCTG 
ACACCCGCCA GAGCACCCGC 
GCGACATCCG CCAGGCCCAC 
AGGTGGCCGA GAAGCTGGGC 
GCGGCGGCGA CCCCGAGATC 
GCAACACCAC CCGCCTGTTC 
ACAACACCCC CGACGAGACC 
GGCAGGAGGT GGGCAAGGCC 
GCAACATCAC CGGCCTGCTG 
AGACCTTCCG CCCCGGCGGC 
ACAAGGTGGT GAAGCTGGAG 
TGGAGCGCGA GAAGCGCGCC 
CCGGCAGCAC CATGGGCGCC 
GCGGCATCGT GCAGCAGCAG 
TGCAGCTGAC CGTGTGGGGC 
ACCTGAAGGA CCAGCAGCTG 
CCAACGTGCC CTGGAACAGC 
TGACCTGGAT GGAGTGGGAG 
TGGAGGTGAG CCAGATCCAG 
GGGCCAGCCT GTGGAACTGG 
TCATGATCGT GGGCGGCCTG 
ACCGCGTGCG CCAGGGCTAC 
AGCCCGACCG CCCCGAGGGC 
TCCGCCTGGT GAACGGCCTG 
TCAGCTACCA CCGCCTGCGC 
GCCGCCGCGG CTGGGAGGCC 
AGCTGAAGAA CAGCGCCGTG 
CCGACCGCGC CATCGAGCTG 
GCATCCGCCA GGGCCTGGAG 



FIGURE 16 

gpieOmod.ELI 



1 ATGCGCGCCC GCGGCATCGA GCGCAACTGC 
61 CTGGGCATCC TGATGACCTG CAGCGCCGCC 
121 GTGCCCGTGT GGAAGGAGGC CACCACCACC 
181 GAGACCGAGG CCCACAACAT CTGGGCCACC 
241 CAGGAGATCG CCCTGGAGAA CGTGACCGAG 
301 GAGCAGATGC ACGAGGACAT CATCAGCCTG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC 
421 GGCAACAACG TGACCACCGA GGAGAAGGGC 
481 GTGCTGAAGG ACAAGAAGCA GCAGGTGTAC 
541 ATCGACAACG ACAGCAGCAC CAACAGCACC 
601 GCCATCACCC AGGCCTGCCC CAAGGTGAGC 
661 CCCGCCGGCT TCGCCATCCT GAAGTGCCGC 
721 ACCAACGTGA GCACCGTGCA GTGCACCCAC 
781 CTGCTGAACG GCAGCCTGGC CGAGGAGGAG 
841 AACGCCAAGA ACATCATCGC CCACCTGAAC 
901 TACCAGAACA CCCGCCAGCG CACCCCCATC 
961 AGCCGCAGCA TCATCGGCCA GGCCCACTGC 
1021 CTGCAGCAGG TGGCCCGCAA GCTGGGCACC 
1081 CCCAGCAGCG GCGGCGACCC CGAGATCACC 
1141 TTCTACTGCA ACACCAGCGG CCTGTTCAAC 
1201 ATCACCGAGA GCAACAACAG CACCAACACC 
1261 ATCATCAAGA TGGTGGCCGG CCGCAAGGCC 
1321 CTGTGCAGCA GCAACATCAC CGGCCTGCTG 
1381 ACCAACGAGA CCTTCCGCCC CGGCGGCGGC 
1441 TACAAGTACA AGGTGGTGCA GATCGAGCCC 
1501 CGCGTGGTGG AGCGCGAGAA GCGCGCCATC 
1561 GGCGCCGCCG GCAGCACCAT GGGCGCCCGC 
1621 CTGATGAGCG GCATCGTGCA GCAGCAGAAC 
1681 CACCTGCTGC AGCTGACCGT GTGGGGCATC 
1741 GAGCGCTACC TGAAGGACCA GCAGCTGCTG 
1801 TGCACCACCA ACGTGCCCTG GAACAGCAGC 
1861 CAGAACATGA CCTGGATGGA GTGGGAGCGC 
1921 AGCCTGATCG AGGAGAGCCA GACCCAGCAG 
1981 GACAAGTGGG CCAGCCTGTG GAACTGGTTC 
2041 ATCTTCATCA TGATCATCGG CGGCCTGATC 
2101 CTGGTGAACC GCGTGCGCCA GGGCTACAGC 
2161 CCCCGCGGCC CCGACCGCCC CGAGGGCACC 
2221 CGCAGCGTGC GCCTGCTGAA CGGCTTCAGC 
2281 TGCCTGTTCA GCTACCACCG CCTGCGCGAC 
2341 CTGCTGGGCC GCCGCGGCTG GGACATCCTG 
2401 AGCCAGGAGC TGCGCAACAG CGCCAGCAGC 
2461 GAGGGCACCG ACCGCGTGAT CGAGATCATC 
2521 CCCCGCCGCA TCCGCCAGGG CCTGGAGCGC 



CAGAACTGGT GGAAGTGGGG CATCATGCTG 
GACAACCTGT GGGTGACCGT GTACTACGGC 
CTGTTCTGCG CCAGCGACGC CAAGAGCTAC 
CACGCCTGCG TGCCCACCGA CCCCAACCCC 
AACTTCAACA TGTGGAAGAA CAACATGGTG 
TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
AGCGACGAGC TGCGCAACAA CGGCACCATG 
ATGAAGAACT GCAGCTTCAA CGTGACCACC 
GCCCTGTTCT ACCGCCTGGA CATCGTGCCC 
AACTACCGCC TGATCAACTG CAACACCAGC 
TTCGAGCCCA TCCCCATCCA CTACTGCGCC 
GACAAGAAGT TCAACGGCAC CGGCCCCTGC 
GGCATCCGCC CCGTGGTGAG CACCCAGCTG 
GTGATCATCC GCAGCGAGAA CCTGACCAAC 
GAGAGCGTGA AGATCACCTG CGCCCGCCCC 
GGCCTGGGCC AGAGCCTGTA CACCACCCGC 
AACATCAGCC GCGCCCAGTG GAGCAAGACC 
CTGCTGAACA AGACCATCAT CAAGTTCAAG 
ACCCACAGCT TCAACTGCGG CGGCGAGTTC 
AGCACCTGGA ACATCAGCGC CTGGAACAAC 
AACATCACCC TGCAGTGCCG CATCAAGCAG 
ATCTACGCCC CCCCCATCGA GCGCAACATC 
CTGACCCGCG ACGGCGGCAT CAACAACAGC 
GACATGCGCG ACAACTGGCG CAGCGAGCTG 
CTGGGCGTGG CCCCCACCCG CGCCAAGCGC 
GGCCTGGGCG CCATGTTCCT GGGCTTCCTG 
AGCGTGACCC TGACCGTGCA GGCCCGCCAG 
AACCTGCTGC GCGCCATCGA GGCCCAGCAG 
AAGCAGCTGC AGGCCCGCAT CCTGGCCGTG 
GGCATCTGGG GCTGCAGCGG CAAGCACATC 
TGGAGCAACC GCAGCCTGAA CGAGATCTGG 
GAGATCGACA ACTACACCGG CCTGATCTAC 
GAGAAGAACG AGAAGGAGCT GCTGGAGCTG 
AGCATCACCC AGTGGCTGTG GTACATCAAG 
GGCCTGCGCA TCGTGTTCGC CGTGCTGAGC 
CCCCTGAGCT TCCAGACCCT GCTGCCCGCC 
GAGGAGGAGG GCGGCGAGCG CGGCCGCGAC 
GCCCTGATCT GGGACGACCT GCGCAGCCTG 
CTGATCCTGA TCGCCGTGCG CATCGTGGAG 
AAGTACCTGT GGAACCTGCT GCAGTACTGG 
CTGTTCGACG CCATCGCCAT CGCCGTGGCC 
CAGCGCGCCT GCCGCGCCGT GCTGAACATC 
AGCCTGCTGT AA 



FIGURE 17 

gpl60inod.93IN101 



1 ATGCGCGTGC GCGGCACCCT GCGCAACTAC CAGCAGTGGT GGATCTGGGG CGTGCTGGGC 
61 TTCTGGATGC TGATGATCTG CAACGGCGGC GGCAACCTGT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGAAGGAGGC CAAGACCACC CTGCTGTGCG CCAGCGACGC CAAGGCCTAC 
181 GAGCGCGAGG TGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 
241 CAGGAGATCG TGCTGGGCAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CGACATGGTG 
301 GACCAGATGC ACGAGGACGT GATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
361 CTGACCCCCC TGTGCGTGAC CCTGGAGTGC CGCAACGTGA GCCGCAACGT GAGCAGCTAC 
421 AACACCTACA ACGGCAGCGT GGAGGAGATC AAGAACTGCA GCTTCAACGC CACCCCCGAG 
481 GTGCGCGACC GCAAGCAGCG CATGTACGCC CTGTTCTACG GCCTGGACAT CGTGCCCCTG 
541 AACAAGAAGA ACAGCAGCGA GAACAGCAGC GAGTACCGCC TGATCAACTG CAACACCAGC 
601 GCCATCACCC AGGCCTGCCC CAAGGTGACC TTCGACCCCA TCCCCATCCA CTACTGCGCC 
661 CCCGCCGGCT ACGCCATCCT GAAGTGCAAC AACAAGACCT TCAACGGCAC CGGCCCCTGC 
721 AACAACGTGA GCACCGTGCA GTGCACCCAC GGCATCAAGC CCGTGGTGAG CACCCAGCTG 
781 CTGCTGAACG GCAGCCTGGC CGAGGGCGAG ATCATCATCC GCAGCGAGAA CCTGACCAAC 
841 AACGTGAAGA CCATCATCGT GCACCTGAAC CAGAGCGTGG AGATCGTGTG CACCCGCCCC 
901 AACAACAACA CCCGCAAGAG CATCCGCATC GGCCCCGGCC AGACCTTCTA CGCCACCGGC 
961 GACATCATCG GCGACATCCG CCAGGCCCAC TGCAACATCA GCCGCGACAA GTGGAACGAG 
1021 ACCCTGCAGC GCGTGGGCAA GAAGCTGGCC GAGCACTTCC ACAACAAGAC CATCAAGTTC 
1081 GCCAGCAGCA GCGGCGGCGA CCTGGAGATC ACCACCCACA GCTTCAACTG CCGCGGCGAG 
1141 TTCTTCTACT GCAACACCAG CGGCCTGTTC AACGGCACCT ACATGCCCAC CTACATGCCC 
1201 AACGGCACCG AGAGCAACAG CAACAGCACC ATCACCATCC CCTGCCGCAT CAAGCAGATC 
1261 ATCAACATGT GGCAGGAGGT GGGCCGCGCC ATGTACGCCC CCCCCATCGC CGGCAACATC 
1321 ACCTGCACCA GCAACATCAC CGGCCTGCTG CTGGTGCACG ACGGCGGCAT CAAGGAGAAC 
1381 GACACCGAGA ACAAGACCGA GATCTTCCGC CCCGGCGGCG GCGACATGCG CGACAACTGG 
1441 CGCAGCGAGC TGTACAflGTA CAAGGTGGTG GAGATCAAGC CCCTGGGCGT GGCCCCCACC 
1501 GCCGCCAAGC GCCGCGTGGT GGAGCGCGAG AAGCGCGCCG TGGGCATCGG CGCCGTGTTC 
1561 CTGGGCTTCC TGGGCGCCGC CGGCAGCACC ATGGGCGCCG CCAGCATCAC CCTGACCGCC 
1621 CAGGCCCGCC AGCTGCTGAG CGGCATCGTG CAGCAGCAGA GCAACCTGCT GCGCGCCATC 
1681 GAGGCCCAGC AGCACCTGCT GCAGCTGACC GTGTGGGGCA TCAAGCAGCT GCAGACCCGC 
1741 GTGCTGGCCA TCGAGCGCTA CCTGAAGGAC CAGCAGCTGC TGGGCATCTG GGGCTGCAGC 
1801 GGCAAGCTGA TCTGCACCAC CGCCGTGCCC TGGAACAGCA GCTGGAGCAA CAAGACCCAG 
1861 AGCGAGATCT GGAACAACAT GACCTGGATG CAGTGGGACC GCGAGGTGAG CAACTACACC 
1921 AACATCATCT ACAGCCTGCT GGAGGAGAGC CAGAACCAGC AGGAGAAGAA CGAGAAGGAC 
1981 CTGCTGGCCC TGGACAGCTG GAAGAACCTG TGGAGCTGGT TCGACATCAC CAACTGGCTG 
2041 TGGTACATCA AGATCTTCAT CATGATCGTG GGCGGCCTGA TCGGCCTGCG CATCATCTTC 
2101 GCCGTGCTGA GCATCGTGAA CCGCGTGC6C CAGGGCTACA GCCCCCTGAG CTTCCAGACC 
2161 CTGACCCCCA ACCCCCGCGG CCCCGACCGC CTGGGCCGCA TCGAGGAGGA GGGCGGCGAG 
2221 CAGGACAAGG ACCGCAGCAT CCGCCTGGTG AACGGCTTCC TGGCCCTGGC CTGGGACGAC 
2281 CTGCGCAACC TGTGCCTGTT CAGCTACCAC CGCCTGCGCG ACTTCATCAG CGTGGCCGCC 
2341 CGCGTGGTGG AGCTGCTGGG CCGCAGCAGC TGGGAGGCCC TGAAGTACCT GGGCAGCCTG 
2401 GTGCAGTACT GGGGCCTGGA GCTGAAGAAG AGCGCCATCA GCCTGTTCGA CAGCATCGCC 
2461 ATCGTGGTGG CCGAGGGCAC CGACCGCATC ATCGAGCTGG TGCAGGGCTT CTGCCGCGCC 
2521 ATCCGCAACA TCCCCACCCG CATCCGCCAG GGCTTCGAGG CCGCCCTGCA GTAA 



FIGURE 18 

gp 1 60mod.cm235. V3con 



1 ATGGATGCAA TGAAGAGAGG GCTCTGCTGT 
61 TCGCCCAGCG CTAGCAACAA CCTGTGGGTG 
121 GACGCCGACA CCACCCTGTT CTGCGCCAGC 
181 AACGTGTGGG CCACCCACGC CTGCGTGCCC 
241 GAGAACGTGA CCGAGAACTT CAACATGTGG 
301 GACGTGATCA GCCTGTGGGA CCAGAGCCTG 
361 GTGACCCTGA ACTGCACCAA CGCCAAGCTG 
421 AACACCATCG GCAACATCAC CGACGAGGTG 
481 CTGCGCGACA AGAAGCAGAA GGTGCACGCC 
541 GAGGACAACA AGACCAGCAG CGAGTACCGC 
601 CAGGCCTGCC CCAAGATCAG CTTCGACCCC 
661 TACGCCATCC TGAAGTGCAA CGACAAGAAC 
721 AGCAGCGTGC AGTGCACCCA CGGCATCAAG 
781 GGCAGCCTGG CCGAGGAGGA GATCATCATC 
841 ACCATCATCG TGCACCTGAA CAAGAGCGTG 
901 ACCCGCACCA GCATCACCAT CGGCCCCGGC 
961 GGCGACATCC GCAAGGCCTA CTGCGAGATC 
1021 CAGGTGACCG AGAAGCTGAA GGAGCACTTC 
1081 AGCGGCGGCG ACCTGGAGAT CACCATGCAC 
1141 TGCAACACCA CCCGCCTGTT CAACAACACC 
1201 AACGGCACCA TCATCCTGCC CTGCAAGATC 
1261 GGCCAGGCCA TGTACGCCCC CCCCATCAGC 
1321 GGCATCCTGC TGACCCGCGA CGGCGGCGCC 
1381 GGCGGCGGCA ACATCAAGGA CAACTGGCGC 
1441 ATCGAGCCCC TGGGCATCGC CCCCACCCGC 
1501 CGCGCCGTGG GCATCGGCGC CATGATCTTC 
1561 GGCGCCGCCA GCATCACCCT GACCGTGCAG 
1621 CAGCAGAGCA ACCTGCTGCG CGCCATCGAG 
1681 TGGGGCATCA AGCAGCTGCA GGCCCGCGTG 
1741 AAGTTCCTGG GCCTGTG6GG CTGCAGCGGC 
1801 AACAGCACCT GGAGCAACCG CAGCTACGAG 
1861 TGGGAGCGCG AGATCAGCAA CTACACCAAC 
1921 AACCAGCAGG ACCGCAACGA GAAGGACCTG 
1981 AACTGGTTCG ACATCACCAA GTGGCTGTGG 
2041 GGCCTGATCG GCCTGCGCAT CATCTTCGCC 
2101 GGCTACAGCC CCCTGAGCTT CCAGACCCCC 
2161 GAGCGCATCG AGGAGGGCGG CGGCGAGCAG 
2221 GGCTTCCTGG CCCTGGCCTG GGACGACCTG 
2281 CTGCGCGACT TCATCCTGAT CGCCGCCCGC 
2341 AAGGGCCTGC GCCGCGGCTG GGAGGGCCTG 
2401 GGCCAGGAGC TGAAGATCAG CGCCATCAGC 
2461 GGCTGGACCG ACCGCGTGAT CGAGGTGGCC 
2521 CCCCGCCGCA TCCGCCAGGG CCTGGAGCGC 



GTGCTGCTGC TGTGTGGAGC AGTCTTCGTT 
ACCGTGTACT ACGGCGTGCC CGTGTGGCGC 
GACGCCAAGG CCCACGAGAC CGAGGTGCAC 
ACCGACCCCA ACCCCCAGGA GATCCACCTG 
AAGAACAACA TGGTGGAGCA GATGCAGGAG 
AAGCCCTGCG TGAAGCTGAC CCCCCTGTGC 
ACCAACGTGA ACAACATCAC CAGCGTGAGC 
CGCAACTGCA GCTTCAACAT GACCACCGAG 
CTGTTCTACA AGCTGGACAT CGTGCCCATC 
CTGATCAACT GCAACACCAG CGTGATCAAG 
ATCCCCATCC ACTACTGCAC CCCCGCCGGC 
TTCAACGGCA CCGGCCCCTG CAAGAACGTG 
CCCGTGGTGA GCACCCAGCT GCTGCTGAAC 
CGCAGCGAGA ACCTGACCAA CAACGCCAAG 
GAGATCAACT GCACCCGCCC CAGCAACAAC 
CAGGTGTTCT ACCGCACCGG CGACATCATC 
AACGGCACCA AGTGGAACGA GGTGCTGACC 
AACAACAAGA CCATCATCTT CCAGCCCCCC 
CACTTCAACT GCCGCGGCGA GTTCTTCTAC 
TGCATCGAGA ACGGCACCAT GGGCGGCTGC 
AAGCAGATCA TCAACATGTG GCAGGGCGCC 
GGCCGCATCA ACTGCGTGAG CAACATCACC 
ATCAACACCA CCAACGAGAC CTTCCGCCCC 
AGCGAGCTGT ACAAGTACAA GGTGGTGCAG 
GCCAAGCGCC GCGTGGTGGA GCGCGAGAAG 
GGCTTCCTGG GCGCCGCCGG CAGCACCATG 
GCCCGCCAGC TGCTGAGCGG CATCGTGCAG 
GCCCAGCAGC ACCTGCTGCA GCTGACCGTG 
CTGGCCGTGG AGCGCTACCT GAAGGACCAG 
AAGATCATCT GCACCACCGC CGTGCCCTGG 
GAGATCTGGA ACAACATGAC CTGGATCGAG 
CA6ATCTACG AGATCCTGAC CGAGAGCCAG 
CTGGAGCTGG ACAAGTGGGC CAGCCTGTGG 
TACATCAAGA TCTTCATCAT GATCATCGGC 
GTGCTGAGCA TCGTGAACCG CGTGCGCCAG 
TTCCACCACC AGCGCGAGCC CGACCGCAGC 
GGCCGCGACC GCAGCGTGCG CCTGGTGAGC 
CGCAGCCTGT GCCTGTTCAG CTACCACCGC 
ACCGTGAAGC TGCTGGGCCG CAGCAGCCTG 
AAGTACCTGG GCAACCTGCT GCTGTACTGG 
CTGCTGGACG CCACCGCCAT CATCGTGGCC 
CAGGGCGCCT GGCGCGCCAT CCTGCACATC 
ACCCTGCTGT AA 



FIGURE 19 

gpl60partialmod.cm235.V3 con 



1 ATGGATGCAA TGAAGAGAGG GCTCTGCTGT 
61 TCGCCCAGCG CTAGCAACAA CTTGTGGGTT 
121 GATGCAGATA CCACCCTATT TTGTGCATCA 
181 AATGTCTGGG CCACACATGC CTGTGTACCC 
241 GAAAATGTAA CAGAAAATTT TAACATGTGG 
301 GATGTAATCA GTTTATGGGA TCAAAGTCTA 
361 GTTACTTTAA ATTGTACCAA TGCTAAGTTG 
421 AACACAATAG GAAATATAAC AGATGAAGTA 
481 CTAAGAGATA AGAAGCAGAA GGTCCATGCA 
541 GAAGATAATA AGACTAGTAG TGAGTATAGG 
601 CAGGCTTGTC CAAAGATATC CTTTGATCCA 
661 TATGCGATTT TAAAGTGTAA TGATAAGAAT 
721 AGCTCAGTAC AATGCACACA TGGAATTAAG 
781 GGCAGTCTAG CAGAAGAAGA GATAATAATC 
841 ACCATAATAG TGCACCTTAA TAAATCTGTA 
901 ACAAGAACAA GTATAACTAT AGGACCAGGA 
961 GGAGATATAA GAAAAGCATA TTGTGAGATT 
1021 CAGGTAACTG AAAAATTAAA AGAGCACTTT 
1081 TCAGGAGGAG ATCTAGAAAT TACAATGCAT 
1141 TGCAATACAA CACGACTGTT TAATAATACT 
1201 AATGGCACTA TCATACTTCC ATGCAAGATA 
1261 GGACAAGCAA TGTATGCTCC TCCCATCAGT 
1321 GGAATACTAT TGACAAGAGA TGGTGGTGCT 
1381 GGCGGCGGCA ACATCAAGGA CAACTGGCGC 
1441 ATCGAGCCCC TGGGCATCGC CCCCACCCGC 
1501 CGCGCCGTGG GCATCGGCGC CATGATCTTC 
1561 GGCGCCGCCA GCATCACCCT GACCGTGCAG 
1621 CAGCAGAGCA ACCTGCTGCG CGCCATCGAG 
1681 TGGGGCATCA AGCAGCTGCA GGCCCGCGTG 
1741 AAGTTCCTGG GCCTGTGGGG CTGCAGCGGC 
1801 AACAGCACCT GGAGCAACCG CAGCTACGAG 
1861 TGGGAGCGCG AGATCAGCAA CTACACCAAC 
1921 AACCAGCAGG ACCGCAACGA GAAGGACCTG 
1981 AACTGGTTCG ACATCACCAA GTGGCTGTGG 
2041 GGTTTAATAG GTTTAAGGAT AATTTTTGCT 
2101 GGATACTCAC CTTTGTCTTT CCAGACCCCT 
2161 GAAAGAATCG AAGAAGGAGG TGGCGAGCAA 
2221 GGATTCTTAG CTCTTGCGTG GGACGATCTA 
2281 TTGAGAGACT TCATCTTGAT TGCAGCGAGG 
2341 AAGGGACTGA GACGGGGGTG GGAAGGTCTC 
2401 GGTCAGGAAC TAAAAATTAG CGCTATTTCT 
2461 GGGTGGACAG ATAGGGTTAT AGAAGTAGCA 
2521 CCTAGGAGAA TCAGACAGGG CTTAGAAAGG 



GTGCTGCTGC TGTGTGGAGC AGTCTTCGTT 
ACAGTTTATT ATGGGGTTCC TGTGTGGAGA 
GATGCCAAAG CACATGAGAC AGAAGTGCAC 
ACAGACCCCA ACCCACAAGA AATACACCTG 
AAAAATAACA TGGTAGAGCA GATGCAGGAG 
AAGCCATGTG TAAAGTTAAC TCCTCTCTGC 
ACCAATGTCA ATAACATAAC CAGTGTCTCT 
AGAAACTGTT CTTTTAATAT GACCACAGAA 
CTTTTTTATA AGCTTGATAT AGTACCAATT 
TTAATAAATT GTAATACTTC AGTCATTAAG 
ATTCCTATAC ATTATTGTAC TCCAGCTGGT 
TTCAATGGGA CAGGGCCATG TAAAAATGTC 
CCAGTGGTAT CAACTCAATT GCTGTTAAAT 
AGATCTGAAA ATCTCACAAA CAATGCCAAA 
GAAATCAATT GTACCAGACC CTCCAACAAT 
CAAGTATTCT ATAGAACAGG AGACATAATA 
AATGGAACAA AATGGAATGA AGTTTTAACA 
AATAATAAGA CAATAATCTT TCAACCACCC 
CATTTTAATT GTAGAGGGGA ATTTTTCTAT 
TGCATAGAAA ATGGAACCAT GGGGGGGTGT 
AAGCAAATTA TAAACATGTG GCAGGGAGCA 
GGAAGAATTA ATTGTGTATC AAATATTACA 
ATTAATACAA CTAATGAGAC CTTCCGCCCC 
AGCGAGCTGT ACAAGTACAA GGTGGTGCAG 
GCCAAGCGCC GCGTGGTGGA GCGCGAGAAG 
GGCTTCCTGG GCGCCGCCGG CAGCACCATG 
GCCCGCCAGC TGCTGAGCGG CATCGTGCAG 
GCCCAGCAGC ACCTGCTGCA GCTGACCGTG 
CTGGCCGTGG AGCGCTACCT GAAGGACCAG 
AAGATCATCT GCACCACCGC CGTGCCCTGG 
GAGATCTGGA ACAACATGAC CTGGATCGAG 
CAGATCTACG AGATCCTGAC CGAGAGCCAG 
CTGGAGCTGG ACAAGTGGGC CAGCCTGTGG 
TACATCAAAA TATTTATAAT GATAATAGGA 
GTGCTTTCTA TAGTGAATAG AGTTAGGCAG 
TTCCATCATC AGAGGGAACC CGACAGATCC 
GGCAGAGACA GATCCGTGCG ATTAGTGAGC 
CGGAGCCTGT GCCTCTTCAG CTACCACCGC 
ACTGTGAAAC TTCTGGGACG CAGCAGTCTC 
AAATATCTGG GGAATCTTCT GTTATATTGG 
TTGCTTGATG CTACAGCAAT AATAGTAGCG 
CAAGGAGCTT GGAGAGCCAT TCTCCACATA 
ACTTTGCTAT AA 



